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1 Introduction 



1.1 Pointwise ergodic theorems along sparse sets 

Subsequence ergodic theorems have been related to interesting questions since the beginnings 
of the subject: for instance, the equidistribution of {n 2 a} for a ^ Q corresponds to the mean 
convergence of the averages along the squares, 



The pointwise theory of nonstandard ergodic averages has an eventful history marked by a 
combination of "qualitative" methods, characterized by an interest in infinitary properties such 
as convergence, and "quantitative" techniques, which spring from a more hard-analytical back- 
ground. In the qualitative direction, Bellow and Losert [2] were the first to construct sparse 
sequences {n^} for which pointwise ergodic theorems could be proved; these sequences consist 
of increasingly large blocks of integers separated by increasingly, but not arbitrarily, vast gaps. 
The pointwise convergence (almost everywhere) of these averages followed from covering lemmas 
and asymptotic properties of the averaging sets. 

A seminal quantitative result was Bourgain's proof [5] that the averages along the squares 
(II. ip converge pointwise (almost everywhere) as N — > oo for any dynamical system (X, T) and 
any / E L 2 (X). This result, and its many subsequent variants, used a transference argument 
and the Fourier-analytic properties of corresponding exponential sums to prove maximal and 
oscillational inequalities for the averages. 

One distinction between the two methods concerns the endpoint space L^X). The methods 
applied in [2] automatically prove convergence for all / G L < , but the Fourier transform methods 
for various sequences could be extended to L p , p > 1, only by interpolation. As a result, the L l 
case for the sequence of squares remained open- for two decades, the sparsity of block sequences 
remained the only kind of sparsity for which L 1 pointwise ergodic theorems could be proved. 

In recent years, there has been substantial progress on the L 1 pointwise ergodic theory for 
sparse averages via a technique of Fefferman [11] and Christ [8j which arises from the theory 
of singular integrals. Urban and Zienkiewicz [23] used this method to prove an L 1 pointwise 
ergodic theorem for the averages along \n a \ (for a slightly greater than 1), and LaVictoire |15j 
proved a similar pointwise theorem for random sequences: specifically that averages taken along 
a sequence of integers chosen randomly, and nearly as sparse as the sequence of square numbers, 
must converge pointwise almost surely for / € L . 

However, Buczolich and Mauldin [7] proved that the averages along the squares (jl.ip do not 
satisfy an L 1 pointwise ergodic theorem; this argument is fundamentally a qualitative proof but 
uses some quantitative results from number theory. This result, among others, demonstrates 
that the L 1 case is distinct from, and requires different approaches compared to, the LP case, 
p>l. 

Finally, in [10], Christ has provided new deterministic examples of sparse sequences that al- 
low for pointwise convergence results in L 1 . These sequences are constructed from the sets 
{(a, a 2 , . . . , a n ) C Z" : a £ ^p}, using Freiman isomorphisms to find sequences of integers which 




(1.1) 



k=l 
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have similar Fourier properties. The Weil bounds on the corresponding exponential sums give 
optimal Fourier bounds on the averages along such a sequence; these optimal Fourier bounds 
allow the application of another version of the technique of Fefferman and Christ, which obtains 
a weak (1,1) maximal inequality. 

1.2 Pointwise ergodic theorems over discrete groups 

Another classically interesting extension of ergodic theory concerns more general averages ob- 
tained from a measure-preserving action T of a discrete group G on a probability space X. 
To each g E G we associate a measure-preserving transformation T(g) on X, such that T{g) o 
7~(</) = T(gg')- If G = 7j d , in particular, a measure-preserving group action consists of d 
commuting transformations (Ti, . . . , T^); for each ft E we have the transformation T(n) = 

rpn\ i-nna 
1 1 " ' 1 d ■ 

Tempelman [22] obtained a fairly general pointwise ergodic theorem (Theorem 12. ip for group 
actions, defined with respect to a sequence of finite subsets Fn C G. We consider the averages 



If the sequence of sets {i*jv} satisfies the Tempelman condition (|2.ip . then for any / E L l (X), 
Apff will converge pointwise for every measure-preserving action T, irrespective of the under- 
lying probability space X. 

There is a useful class of discrete groups which we will consider in particular: 

Definition A finitely generated group G is virtually nilpotent if it contains a nilpotent subgroup 
of finite index. 

If we consider a finite set A of generators of a group G, and let denote the words of length 
< N in that alphabet, we have that these virtually nilpotent groups are precisely those in which 
#A N grows at a rate polynomial in N (see [16], [26], and [12]). This polynomial growth rate 
allows for interesting applications of both qualitative and quantitative techniques in the cases 
of block sequences and random sequences, respectively, ultimately resulting in new pointwise 
convergence results. 

1.3 Sparse averages over discrete groups 

Combining the qualitative and quantitative approaches, we study sparse ergodic averages in the 
group setting and prove L 1 pointwise convergence results in a number of contexts. 

By sparse, we refer to a quality of a set, taken from a group G, analagous to that of being 
zero density in the integers. In the integers, density is determined by comparing a set to nested 
sets of closed intervals. When we consider virtually nilpotent groups, however, there are more 
choices for sets to which we can compare ours than in the case G = Z. The sets defined by 
word-length in the generating set prove to be a good choice; by a result of Pansu [T7], the cor- 
responding notion of sparseness is independent of our choice of generating set. 

In addition to this basic idea of sparseness, we will also consider other descriptions of the 
density of a set. The first is an extension of the idea of upper Banach density; the second is an 
extension of the condition that gaps tend to oo. 
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Definition Let F be a subset of a virtually nilpotent group G, and A a finite generating set. 
We say F is sparse if 

# (F n ) 

lim JlAiV =0 - 

7V->oo #A^ 
We say that F has Banach density if 

hm sup —j-r = 0. 

Finally, we say that F has gaps tending to oo if, for every N, there are only finitely many g G F 
such that gA N f]F ^{g}. 

We will exclude from our consideration sets that are sparse by virtue of essentially being averages 
over a subgroup of G- for instance the sets {1, . . . , N} x {0} C Z 2 - since these are classical group 
averages in disguise. That is, we will be looking for sequences of sets Fjy such that if one takes 

oo 

any sequence of subsets C F/v with lim sup ^tP F N = 0, then (J F u \Dn still generates the 

N^oo ff n JV=1 

entire group G. 



1.4 Results 

We have three main categories of result, each obtained by a different method. Furthermore, in 
the case G = Z d , each of these has two variants. Colloquially, we call a construction plaid if it 
consists of a Cartesian product of sparse subsets of Z, and we call it native if it is built in a 
genuinely d-dimensional manner. It should be noted that the results in the plaid case are still 
nontrivial, since the averages do not factor. (The associated Fourier transforms do factor, but 
this helps only in 1? and not in L .) 

The first category of results, which are discussed in Section [2] for Z rf and Section \5. II for virtually 
nilpotent groups, extends the sparse block sequence result of Bellow and Losert [2] by show- 
ing that appropriate sparse sequences of sets satisfy the Tempelman Condition (|2.ip and thus 
Tempelman's Ergodic Theorem (Theorem 12. ip applies. These sets are sparse, but do not have 
Banach density 0. 

The second category, developed in Sections [3] for 7L d and 15.21 for virtually nilpotent groups, ex- 
tends the sparse random sequence result of LaVictoire [15]. For simplicity, we will state here 
the speckled Z rf version only. Let : n G Z d } be independent {0, l}-valued random variables, 
such that if 2? < \n\ < 2^' +1 , then P(f s = 1) = w |n|~ 7 . 

Then for 7 < d/2, with probability 1, the random set generated by satisfies an L 1 pointwise 
ergodic theorem: 

Theorem 1.1. For 7 < d/2, the following holds for all 00 G Q except for a set of probability 0: 
Let {dk}k<^n be an enumeration of the set {n : ^(w) = 1} with \3k\ increasing. Then for any 
measure-preserving X d -action T and any f G L l (X), the averages 

1 N 

A N f(x) :=-^/oT(a fc )(*) 
fc=i 

converge almost everywhere in X. 
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Additionally, we prove a plaid version on 7L d (Corollary 13 . 5 j) and a version on general groups 
(Theorem I5.5|) . The latter requires us to first prove an I? ergodic theorem (Theorem 15. 4|) since 
the Fourier transform methods which work for Z d do not work on nonabelian groups. An argu- 
ment based in combinatorics and the TT* method from harmonic analysis suffices instead. 

With probability 1, these random sequences are not only sparse, but have Banach density 
0. Further, each can be modified to a sequence whose gaps tend to oo and along which an L 1 
pointwise ergodic theorem still holds (Section I5.3H . 

The third category of result, developed in Section [H extends the sparse deterministic result 
of Christ [TO] to Z rf with both a native version (Theorem 14. 4|) and a product version (Theorem 
I4.7p . We will state the simplest form of the first version. 

Theorem 1.2. Let p^ be prime numbers with 2 k < p^ < 2 fe+ 3 . Then if we take the sparse set 

oo Pfe-l 

S={J U( 2fc+1 + [ikJj 2 kr--,[/k) 

k=l j=0 

and order it by first coordinate, the averages along this set converge (almost everywhere) for 
every measure-preserving TJ 1, -action T and any f G L 1 (X). 

The a.e. convergence in this result is new even for the original averages in Z; in [10] only the 
weak (1, 1) maximal inequality is proved. The existence of a dense class for which pointwise 
convergence holds follows from an oscillational inequality (Theorem I4.3P for the averages, as 
in Section IV. 2 of Rosenblatt and Wierdl's monograph, [18]. The pointwise convergence for L 1 
functions follows. These sequences are sparse as well, and can be modified into a sequence whose 
gaps tend to oo without affecting the a.e. convergence of the averages. 

Acknowledgments 

The authors thank M. Christ for substantial help on the results in Section 4, and for dissemi- 
nating the preprint |10j . 



2 Averages Along Blocks 

A sequence of sets {F n } taken from a discrete group G is called a F0lner sequence if each set is 
finite and 

lim #(^AF„) =() 

for every g G G. 



A F0lner sequence is said to satisfy the Tempelman Condition if there is a constant C so that 

# {F^Fn) < C#F n . (2.1) 

Definition Let (X, J 7 , m) be a probability space and {T g : g G G} a group of measure-preserving 
transformations on X with TgT^ = T g h for all g,h G G. We say that {T g } is a measure-preserving 
group action. 
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Theorem 2.1 (Tempelman's Ergodic Theorem, [22 J. Suppose that T is a measure-preserving 
action of G on the space X and that {F n } is a nested F0lner sequence that satisfies the Tempel- 
man Condition. Then the averages 

converge for a.e. x € X. 

In this section we seek a generalized form of the block sequence example of Bellow and Losert [2] . 
We will examine both a plaid and a speckled version: our method, in each case, will be to show 
that a suitably chosen F0lner sequence satisfies the Tempelman Condition, 12.11 We note here 
that while each construction in this section will have zero density, neither are of Banach density 
zero: much like the original construction, the existence of the block-like structures does not allow 
for this type of sparseness. Further, in the first construction, we consider only two-variable free 
actions. The ideas presented, however, will work in any finite number of variables. 



2.1 A Plaid Construction 

In this construction, our aim is to build a sparse sequence of sets by considering the products 
of one-dimensional sparse sequences. Among the difficulties in two dimensions is the loss of 
a natural order in which to take our averages; hence we must not only define our larger set, 
whence we derive sparseness, but also an ordering of its elements. This ordering will give us our 
sequence of sets. 

We start with a sequence of rectangles {Dk} in I?, where each Dk = Rk + (uk,Vk) and Rk = 
[I, ak] x [1,6^]. We will need to have these rectangles well spaced, so we will assume that 

Uk+i >u k + a k and v k+ i > v k + b k . (2.2) 

We also want to arrange for these rectangles to provide us projections along the axes that give 
sequences of zero density. So we will also assume that 

fe-l k-\ 

£ * £ h o 

— ► and ► as k -> oo. (2.3) 

0>k bk 

It follows from Equation 12.31 that a k — > oo and bk —> oo as k — > oo. 

An additional regularity assumption will be necessary in order to to prove almost everywhere 
convergence results. It is that for some constant C > 0, we have 

a.k > Cuk-i and b k > Cv k -i- (2.4) 

This assumption is one of the conditions used by Bellow and Losert in [2], and by Bellow, Jones, 
and Rosenblatt in [3]. 

We want to use this sequence of blocks to form unions of their projections on the coordinate 

axes, and then form two variable averaging operators by putting these back together. We denote 

the union of the projections of the blocks {D\, . . . , Dk} on the coordinate axes by A(k) = 
k k 

|J [ui + 1, Ui + Oj], and B(k) = \J[vi + 1, Vi + &»]. We want also to work with the intermediate 

i=l i=l 

blocks that come from the projections, so for each k and r = 1, . . . , a/j, we denote by A(k, r) = 
A(k - 1) U [u k + l,u k + r], and B(k,r) = B(k - 1) U [v k + l,v k + r]. Note that A(k,a k ) = A(k) 
and B(k,b k ) = B(k). 
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The first fact to observe from [3], is that the sequences of sets {^4(/c)} and {B(k)} satisfy the 
Cone Condition: thus the operators Akf = — Y f ° S Uk+l and B k f = x~ f T'" k+:i have 

" 8=1 j=l 

maximal functions that are weak (1, 1) and strong (p,p) for all p, 1 < p < oo. Since a k — > oo and 
b k —> oo, we get pointwise and norm convergence to the projection of the invariant functions for 
all L p , 1 < p < oo in each case. 

Also, as implicit in [2] without proof, the intermediate sets {A(k,r) : k > 1, 1 < r < a k } and 

{B(k, r) : k > 1, 1 < r < 6^.} are F0lner sets that satisfy [2TT1 (see Proposition I2.2p . In particular, 

there is a constant such that #(A(k,r) - A(k,r)) < C#A(k,r) and #(B(k,r) - B(k,r)) < 

Cj£B(k,r) for all k and r. As a result, the operators A(k,r)f = ^ A ^ k r ) f ° S l and 

ieA(fe,r) 

B(k,r)f = 4±^/ k r \ f °Ti have maximal functions that are weak (1, 1) and strong (p,p) 

' jeB(k,r) 

for all p, 1 < p < oo. (See Tempelman, [22]). So again we get pointwise and norm convergence 
to the projection of the invariant functions in each case, for all L p , 1 < p < oo. The pointwise 
convergence result is clear from the Cone Condition if one were restricting oneself to the ends 
of the individual block (i.e. using only A(k) = A(k,ak) and B(k) = B(k,b)-)). But this result 
actually requires the following computation even for these sequences, let alone the full sequence 
of intermediate sets. We give this proof because it does not appear in [2] and there are some 
not necessarily so obvious details that do need to be considered. 

Proposition 2.2. The sequences of sets {A(k,r)} and {B(k,r)} satisfy the Tempelman's regu- 
larity conditions. 

Proof. We will consider only {A(k,r)} since the argument for {B(k,r)} is similar. Our sets 
are all finite. Also, any A(k,r\) C A{k + 1,^) if 1 < r\ < k and 1 < r<i < k + 1, and 
A(k,r\) C A(k,r2) if 1 < r\ < r% < k. Because these sets are unions of blocks [m + l,Ui + Oj], 
and an intermediate block, with the lengths a% — > oo as i — > oo, it is clear that the sequence 
(A(k, r)) satisfies the F0lner condition. So the only condition remaining is the fact that there is 
a constant M so that #(A(k, r) — A(k, r)) < M#A(k, r) for all k and r. 
Here we have 

k-l k-l 

A{k, r) - A{k, r) = (J A* U [u k + 1, u k + r] - (J Ai U [u k + l,u k + r] 

i=l i=l 

/k-l k-l \ /k-l \ 



(J Ai - (J Ai U |J Ai - [u k + 1, u k + 



\i=l i=l / \i=l / 

U i^[u k + l,u k + r] - |J A-^j U [u k + l,u k + r]- [u k + l,u k + r]. 

So, as for the size of the set of differences, we are left with 

/k-l k-l \ /k-l \ 

#(A(k,r) - A(k,r)) <# \J A - \J aA + 2# ( (J ^ - [u k + l,u k + r] + 2r - 1. 

\i=i i=i / \i=i / 



fe-i 

By Lemma 3.2 of [2], we know that (J satisfies all of the requirements of Tempelman's 

8=1 

Theorem. In particular, this means there is then a constant M\ so that 

(k-l k-l \ /fe-i \ fc-l 

|J A - |J Ai < Mi# |J A = Mi J^ai. (2.5) 
t=l i=l J \i=l J i=l 
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Combining the assumptions above with our estimate, we then find that there is a constant M2 
so that 

/k-l k-l \ 

# U Ai - U Ai ^ M 2a k -L (2.6) 
\i=i i=i J 

/k-l \ 
We now will turn our attention to # I |J Aj — [Uk + 1, "Ufe + r] I . 



fc-i 

[J A; - [ujt + 1, u fc + r] 
i=l 

/fc-2 \ 

= I |J A - [u k + 1, ti fc + r] I U (A fc _i - [itjfc + 1, tt fc + r]) 

C [-ujt - r, ti fc _ 2 + a fc „ 2 - Ufc - 1] U - u k -r, u k _i + a fc _i - 

Looking at the lengths of these intervals, we find that 

# ^ |J Ai - [n fc + 1, u fc + r] j < n fc _ 2 + a fc _ 2 + a k ^ + 2r + 1. 



But from our requirement relating the growth of (u k ) and (a k ), we know that there is a nonzero 
constant M3 so that 

Uk-2 + CLk-2 < M 3 a k -i- 



So we have that 

# ^(J A t - [u k ,u k + r]j < (M 3 + l)a fc _i +2r + 1. (2.7) 



Combining the inequalities 12.61 and 12. 7[ we find that there are constants M4 and M5 so that 

# (A(r, k) - A(r, k)) < M 4 a fe _i + M 5 r. 

Now, suppose r < a k -\. Then we have that 

# (A(fc, r) - A(fc, r)) < (M 4 + M 5 ) a fc _i. 
But #A(k,r) must be more than <2fc_i since A(fc, r) contains the k — 1 block. So 

#(A(fc,r)-A(fc,r)) < (M 4 + M 5 )#A(k,r). 



If, on the other hand, we have r > Ofc_i, then 

# (A(k, r) - A(k, r)) < (M 4 + M 5 ) r. 
But #A(k, r) is more than r because r) contains the set [v,k + 1, + r]. So again 

# r) - A(k, r)) < (M 4 + M 5 )#A(A;, r). 

□ 

We now want to put the sequences {A(k,r)} and {B(k,r)} back together. Suppose we write 
(J v4(/c,r) = {s m } and |J B(k,r) = {t n } where s = {s m } and t = {t n } are increasing 

(fc,r) {k,r) 

sequences. Then take commuting maps S and T and consider the two variable averages 

M N 

m=l n=l 

Proposition 2.3. The sequence {^4(at 5 at)} has a maximal function that is weak (1, 1) and strong 
(p,p). Hence, for an ergodic 1? action, for all f G L l (X), we have Jim A^,N)f{. x ) = fx f dm. 

Proof. Each A^ N ^ N ^ corresponds to an average over a set of the form A(ki,r±) x B(k2,V2)- 
Because we have taken M = N here, these sets are nested. Also, these sets satisfy the Folner 
condition. So, we obtain our result from Tempelman's Theorem |22] if we have a constant C Q 
such that 

rtiAik^n) x B(k 2 ,r 2 ) - Aikun) x B(k 2 ,r 2 )) < CoftiAikun) x B(k 2 ,r 2 )) . 

But 

#(A(fci,n) x B(k 2 ,r 2 ) - A(k u n) xB(k 2 ,r 2 )) 
= #((A(k 1 ,n)-A(k 1 ,r 1 )) x (B(k 2 ,r 2 )-B(k 2 ,r 2 ))) 
= #(A(kx, ri ) - A(fci,n)) #(B(k 2 ,r 2 ) - B(k 2 ,r 2 )) 
< C 2 #A(A; 1 ,r 1 )# J B(A ;2 ,r 2 ) 
= C 2 #(A(k 1 ,r 1 ) xB(k 2 ,r 2 )). 

□ 

The result above is gives a two variable ergodic theorem with averaging over sets that have 
density zero along all horizontal and vertical lines in Z 2 . 

We would like to turn this block method into a sequence method as appears in Theorem 1 1 . 1 1 and 
Corollary 13.51 But we do not yet know how to amalgate enumerations of the supports of Aw m 
to achieve this. 

We would like to know more about integrability of the maximal function sup |^4(jvjv)/I- But 

N>1 

first we have this basic question. 

Question: Do the maximal functions of A(k,r)f and B(k,r) map LlogL to L 1 ? 

It seems plausible that the answer to this question is affirmative because of the following fact: 
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Proposition 2.4. The maximal functions for Akf and Bkf map LlogL to L 1 . 

Proof. We prove this result just for A* k f ', the maximal function of Akf, because the proof for 
Bkf is identical. We refer to the notation and argument in the proof of Theorem 1, (a) in [3]. 

Here C denotes an absolute constant, but not necessarily the same constant throughout. Let 

k 

f* = sup 2k+i ^ \f{T^ x )\i a two sided version of the classical maximal function. The classical 

fc>oo i=— fe 

result for / G LlogL gives the inequality ||/*||i < C J x |/|(1 + log + |/|) dm. 
Given the Cone Condition, there is a constant C such that for all (f> E Zi(Z), 

#{j : A* k <P(j) > 2A} < C#{j : cf>*(j) > A}. 

This can be seen, with a slight change of notation so as to be consistent with the notation here, 
by using the (-B,), defined on p. 45 in [3], and the inequality if{j\M^(j){j) > 2A} < C \ U on 
p. 46 in 0. 

By Calderon's transfer principle, this gives 



But then we have 



m{x : A* k f(x) > 2A} < C m{x : f*(x) > A}. 



\\Atfh < Cj2m{x:A* k f>2n} 

n=l 



< c m i x ■ f* > n ) 

n=l 

< c\\r\\t 

< C f 1/1(1 + log + |/|) dm. 

Jx 



□ 



Remark: More generally, one might ask how the maximal functions of A(k,r)f and B(k,r)f 
behave on general Orlicz spaces. In particular, for which Orlicz functions and ^2 do these 
maximal functions map ^ / i(L) to ^(L)? 

Question: Does the maximal function of (Ar^mf) map Llog^toL 1 ? This seems the correct 
choice of the domain for this result because the maximal function over the first variable should 
be mapping L log 2 L to L log L. 

To obtain pointwise convergence results on L l (X), we needed to restrict our two variable averages 
A(M, N) to just using Arwrnf. While any nested sequence of rectangles would serve as well, 
there is a good reason to avoid a sequence of sets in which the side lengths are unrelated. 

Proposition 2.5. Suppose S and T commute and generate a free ergodic action. Then there 
exists a function f G L l (X),f > 0, such that sup ^(m,/^)/ = 00 a - e - 

(M,N) 

Proof. Otherwise, by Sawyer's principle (see Sawyer, [E]), there is a weak inequality 

C 

m{x £ X : sup \Ar M mf \ > A} < — 

(M,N) A 
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for all / G L l {X). Now for G ^(Z 2 ), let 

j M AT 
m=l n=l 

Because the action is a free ergodic action, our weak inequality transfers to a weak inequality of 
the form 

C, 



(M,N) 



CL 



#{(i,j) G -L 1 : sup \A {MN) </>(i,j)\ > A} < , 
(M,N) A 

for all G £ 1 (Z 2 ). Now take the function (j) = <5(o,o)- We then consider the set E n = {(s r ,t s ) : 
rs < n}. Given any (i, j) G E n , we have sup ^4(Af,7V)^(^ j) > > ~- Hence, with A = ^, we 

have 

C 

I/V 

> #{(t,j) G Z 2 : sup > -} 

m<n y ' n 

> #E n 

= #{(sr,t s ) :rs<n} 

> era log n. 

Letting n tend to infinity gives a contradiction. □ 

Remark: The divergence of the maximal function precludes there being a pointwise a.e. con- 
vergence result on L l (X). Also, notice here that it is not necessary to transfer the inequality, 
just cleaner to state the idea. One could just work with large square Rokhlin towers of height 
and width ra constructed within X and use f = 1b, where B is the base of the Rokhlin tower. 
A similar argument to the last string of inequalities will lead to a contradiction as n — > oo. 

Suppose now instead that we have an Orlicz space ^f(L) with ^ some regular Orlicz function. 
Both Stein |20j and Sawyer |19] give useful results concerning maximal inequalities for Orlicz 
spaces. We will need to have a regular Orlicz function i.e. one such that H^d/DHi and the 
norm are proportional to one another. Besides the usual properties of Orlicz functions, 

regularity means that for some constant K, we have *$>(2x) < K*$>(x) for all x > 0. 
We can use the results of |19] to prove the following. 



Proposition 2.6. Suppose S and T generate a free ergodic commuting action. If 

sup \A, M mf\ < oo 

(M,N) 

for all f G ^(L), then there exists a constant C such that for all f G ^(L), we have for all 
A > 0, 

m{x: sup |A (A , W /|>A}<C||M/(i|/|)|| 1 . 

(M,N) A 

Otherwise, for a residual set of functions f G *&(L), we have sup \A(M,N)f\ = oo a.e. 

(M,N) 



Proof. See Theorems 3 and 4 in [19]. □ 
This gives the following 
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Proposition 2.7. Suppose S and T generate a free ergodic commuting action. Suppose that 
is a regular Orlicz function such that fy(x) = o(xlog + x) as x — > oo. Then there exists f £ ^(L) 

such that sup \Ai M N \f \ = 00 a.e. 

(M,N) 

Proof. By Proposition 12.61 if such an / did not exist, then we must have the weak inequality in 
the proposition. So take the function / = iVT^ where B is the base of a square Rokhlin tower 
of height and width n given by the free action determined by S and T. We let A = — . Then we 
have 



C*(n)m(B) = C||tt(Ll B )||i 

1 

> m{x : sup Af MN )l B > -} 

(M,N) n 

> cnlognm(B). 

Letting n — > oo gives a contradiction. □ 

Remark: There is perhaps of generalization of the above to other types of averaging besides 
Cesaro averaging. Indeed, suppose (hm) and (z^jv) are two uniformly dissipative averaging 
methods on Z. Consider the two variable average Ai^^rnf = Yl Yl / u m("T')^jv(^)/ S m o T n . 

We conjecture that for a free ergodic commuting action, there would exist a function / > such 
that sup Af M ^ N \f = oo. a.e. As for positive results in this generality, we do not have results 

(M,N) 

at this time. 



But in any case we do know that on L log L the averages A.(m,n) are wen behaved. 

Proposition 2.8. On LlogL, and hence on any L P (X),1 < p < oo, the averages A(M,N)f 
converge a.e. for all f. 

It is clear that the positive results above can be naturally extended to actions of 7L d with 
d > 3 too. There will be analogous issues though on which Orlicz spaces are best to use when 
considering unrestricted mutlivariable averages. 



2.2 A Divergent Construction 

In the following section B r will denote the ball of radius r in Z rf , and B^~ will denote that part 
of the ball consisting of elements all of whose coordinate entries are positive. Similarly, 

7L% = {(ai,a 2 ,...,a d ) £ Z d \ ai >0 for all i > lj . 

Let Ik be a sequence of rectangular prisms in with one corner at the origin, each having 
diameter 1^. We also require that the dimensions b\, 62, &fe of each prism satisfy 

c<^<C 

where c and C are positive, absolute constants. 



Suppose these diameters satisfy 

•4+1 
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as k — > oo. 



Further, suppose we have a sequence of vectors, 3k £ so that 

\Sk+i\ > \a k \ + 4, and 
£ k > C \dk-\\ , 

where C is some constant independent of k. 
Let S = U k {Sk + h)- 

We might like to add points to our average in an order depending only on their distance from 
the origin, i.e. use the sets {S D B r : r > 1}. However, since the ball of radius r is very "flat" 
in the directions of the coordinate axes, we will either have to use a different ordering or place 
restrictions on the locations of blocks. 



Take, for example, the set in Z 2 , S = (a& + Ik), where, in addition to the requirements above, 
we have that 

1. Ik are squares with diameters Ik, 

2. i\ < \a k \, 

3- f 4+1 >(£•=! ^ ? )%nd 
4. a k = (a k ,0). 

Define the subsequence j S n B Tk : r k = \J\3k\ 2 + |^|| • This sequence consists of sets comprised 
of all blocks before the kth, and the left face of the kth. block. 



Define f k (x,y) : J? ->• K by 

fk(x,y) 



' \ for x = a k , < y < &t 
otherwise. 



and let f(x,y) = £ fe /fcO,y). 

We then have that the average over {S n B Tk }, 



1 n, N ^4 

#SHB rk , 2s J[X,y) > (iyk-i f2 \ , 

^ rfc (x, y )eSnB rk I 2 Z^=i *i J + ^ 



while the average taken at the ends of each block 



1 



#SnB r . *s Jy ' a ' £2 

The first average is larger than 1/2 while the second tends to 0; each may be transferred to our 
measure preserving system. This answers in the affirmative the question of whether there is a 
sequence of sets for which a subsequence converges while the larger sequence diverges. 
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This example illustrates the interaction between the metric of the group and the measure of the 
various blocks; it works becuase the additional requirements above allow us to slice off a single 
face of the block at a time, and the growth requirement allows for these faces to outweigh the 
measure of the previous blocks. 

Removing either of these conditions results in a pointwise theorem. For example, suppose we 
required our a k to not lie along an axis. Suppose that our set S is constructed in accord with 
the earlier requirements alone; that is without conditions [T] through [H We will require instead 
that 

a k = (a k ,a k ,...,a k ). (2.8) 

Proposition 2.9. With requirement Ii2.8\) , {S fl B r : r > 1} is a pointwise L l -good sequence of 
sets for any aperiodic Z d action. 

Proof. As in Bellow and Losert's proof for the block sequence, we will rely on the Tempelman 
Ergodic Theorem to complete the proof. The first three requirements are filled- the sets are all 
of finite volume, each is nested within the next, and the F0lner property is satisfied through the 
lacunary growth of the diameters £ k . All that remains is the difference requirement. 

Here we divide our set S n B r into two: a S m = U^L 1 (a k + 1^), including everything up to the 
last complete prism in S, and a remainder R, consisting of everything else. The crucial difference 
between this and the earlier example is that the remainder R must be either trivial or properly 
d-dimensional. 



Since 

s n B r = s m u R, 

we have 

#(5n5 r -5flB r ) < #{S m -S m ) + #{R-S m ) + #{S m -R) + #{R-R) 

= #(S m - S m ) + 2# (R - S m ) + #(R-R). (2.9) 

We will deal with the first term first, noting that this would complete the proof of our claim 
were we only interested in the subsequence consisting of sets with only complete prisms. 

In a similar fashion to the decomposition in Section 15.11 we may write 

Sm — \P>m I-m) U S m —\. 

We then note that 

# (S m — S m ) < # {(S m + Im) ~ (4 + Im)} + 2# {(S m + Im) ~ S m -l} + # {S m -1 ~ S m -l} ■ 

For the first term, we have 

# {(a m + Im) ~ (S m + I m )} = # (I m - I m ) < # (B lm - B e J < #B tm < C#I m , 
For the third term, we have that S m -i C ^|a m _i|+^ m _i i thus the difference 

S m -1 - S m -1 C -B|a m _i|+^ m _i _ ^|3m_i|+< m -U 
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which, in turn, is contained in -B|a m _i|+£ m _i • The volume of this ball is a constant multiple of 
(|a m _i| +£ m -i) d . But by our second condition on the spacing sequence above, 

(\a m -i\ + l m -i) d < (-^£ m + £ m -i 

whence we have that this difference is less than C d £ d n for some constant C. 

For the second difference, we note that shifting a set in Z d does not change its measure. So, 

# {(4 + Im) — S m -i} = # {3 m + I m — S m -i} 

= #{-a m + a m + I m - S m -i) < # {B im - jB|s rn _ 1 | + £ m _ 1 } < #S| Sm _ 1 | + £ m _ 1 , 

which, as before, has a volume less than C d £ d n for some constant C 
So we have that the difference # (S m — 5 m ) < C#S m . 

The size of the second and third terms in 12.91 are dependent upon the diameter of R. Defining s 
to be the diameter of R, we note that s < £ m +i and the measure of #i? > Cs d , by our orignial 
conditions on the block I m +i and requirement (|2.8p . 

For the third term, we note that 

R- R Q (a m+ i + B s ) - (a m+1 + B s ) . 

Thus we have that 

#(R-R)<#(B S - B 8 ) 
= #B S 

< Cs d < c#R, 

where C and c are different constants. 

This leaves us with only the second term. Here we have 

#(R-S m )<#(B s -B ce J 

< C max(s d , (c£ m ) d ) <C#(Sfl B r ) 

for constants c and C. 

□ 

Remark: Taking another approach, we might instead place an additional requirement on the 
diameters, £k, relative to the spacing vectors, 3k, in order to insure that the first set which might 
outweigh the previous blocks would necessarily be d-dimensional. 

3 Multivariable Random Averages 

There are two natural ways that we might extend the results of [15] to Z d actions, depending 
on the choice of random variables corresponding to n. The first, which hews to the product 
structure of Z d , is to write &j = nf=i £i,nn where {Ci,n '■ 1 < i < d; n G N} are independent 
random variables. The other is to simply take independent random variables : n £ Z d }. 
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These different approaches can be characterized as the "plaid" and "speckled" approaches, re- 
spectively, according to the patterns of points that they select in 1? . The relevant distinction, 
for us, will concern the difference set: if we let denote the set of all n with £3 = 1 and 
\n\ < N, then in the speckled case, for any k 7^ there will be (with overwhelming probability) 
not too many ways to write k as the difference of two elements of (and this number of 
representations will in fact be very close to its probabilistic mean); but in the plaid case, there 
will be significantly more representations whenever any of the components of k are zero. For 
this reason, we will first prove the result in the simpler speckled case. 

Remark: We may also consider the "plaid diagonal" pattern obtained by taking £j jn = £1 n for 
all i; however, this does not exhibit any behavior different from the first plaid case, and is more 
difficult to calculate. 

Remark: One might hope that the product theory might help us to prove the maximal inequal- 
ity for the plaid case directly from the one-dimensional result; however, this is not the case in 
L , any more than it is for the Bellow-Losert construction. 

Remark: Our reliance on the "uniformity" of the ways to represent points as elements of 
Sjy ~~ S% gi yes us a natural bound on how sparse a random set we could expect the technique 
to work for: namely, the average number of representations of a point k with \k\ < N should 
tend to infinity as N — > 00, and thus we will need > \/#-6(0, N) ~ N d l 2 . Indeed, we will 
have a result precisely when our random sets have ^S"^ ~ N a with a > d/2. 

3.1 Speckled Random Averages on Z d 

Using the Fourier transform, it is an immediate extension of the one-dimensional theory that 
both the plaid and the speckled sequences are universally L 2 -good. 

Thus we need only prove a weak maximal inequality on L , and (by the positivity of the aver- 
aging operators) this only on dyadically increasing sets. Finally, by the Calderon transference 
principle (Lemma 15. 6p . we can instead prove this maximal theorem for the corresponding con- 
volution operators on Z d . That is, we let : n 6 Z rf } be independent {0, l}-valued random 
variables, such that if 2 j < \n\ < 2^' +1 , then P(^ = 1) = « . Then define 

^ [U) - i otherwise; [6A) 



uf\fi) := \ ^(-)- 2 -^ 2i<|n|<2^ 



otherwise 



Then the random convolution operators / — > f*^j correspond to our random ergodic averages, 

and the Vj are mean variants of the same. The role of the difference set, alluded to previously, 

is expressed in terms of the convolution of the mean measure v^ 1 with its reflection (v) = 

z/j (—v), which we will use later in the argument. Clearly vj 1 ' * vjj (0) = \\%r Z dy but at 
all nonzero points the convolution should be small: 

Lemma 3.1. Let e > 0. With probability 1 in Q,, there exists C U)t < 00 such that 

\\»i U) * t ] h°°(K) <C^-f+^, (3.3) 
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where := Z d \ {0}. Furthermore, 




f\0)\ <C u>e 2^-^. 



(3.4) 



This follows from the more general Lemma I5.1UI and the Borel-Cantelli Lemma. 
We must now prove the following: 

Theorem 3.2. For all 7 < d/2, there is a set of probability linVL such that for each uj in this 
set, there exists C u < 00 such that for every A > 0, 



This immediately implies Theorem 11.11 which gives multivariable averaging methods over sets 
of Banach density zero. 

(We will suppress the superscripts (uj) in the rest of this section, and presume that Uj are 
measures which satisfy the bounds in Lemma 13. II for a constant C.) 

Proof. The method of this proof begins with C alder on- Zygmund theory. We decompose our 
function / in the standard fashion, and imitate the technique of Christ [8] for operators on M. d 
whose kernels are not properly differentiable. For the crucial parts, B, of our decomposition of 
/, we consider the inner products (za- * B, Vj * B) = [Dj * Vj * B, B) and use the extra cancella- 
tion of that convolution to obtain an especially strong L? bound. This is necessary in order to 
compensate for the waste of using an I? bound to prove an L 1 bound. This technique was first 
applied to pointwise ergodic theorems in L 1 by Urban and Zienkiewicz |23j . 

By scaling, we may assume that A ~ 1. We will choose A to be a bit larger than 1, but our 
choice will depend only on d and u), not on / or on j. We apply the standard discrete Calderon- 
Zygmund decomposition at height 1, giving us f = g + b = g + Yl s kbs,k, where \\g\\oo — 1> 
each 6 S) fe is supported on a discrete dyadic cube Q s ^ with side length 2 s , ||6 S) fc||i < 2 ds , and 
J2s k \Qs,k\ — 2 d ||/||i. We can also require each b St k to have mean 0, but we shall not need this. 

Now we will use to our advantage several key properties of and Uj to reduce this problem to 
its final form. First, we have normalized so that with probability 1 in f2, there is a C w such 
that < Cuj for all j, and thus \\g * ^j||oo < C w for all j. Therefore, if sup^- \f * fJ,j(ft)\ > A, 

then supj | Y^ s ,k b s,k * > A - C^. 

We next split b s := ^ fc b St k into two pieces depending on its size. Define 



#{n:sup|/* M r(n)|>A}<^||/||i. 



j 



(3.5) 
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we let &G) := ^i^, 

#{n:sup|^6W* Mj (n)| > 0} < ^#{n : |&C0 * Mj (n)| > 0} 
i s j 

< V|supp^|-#{n:|6(n)|>2( ci -^} 



= C w Y #{™ : 2( ^ 7) '' < IK") I < 2^-^ +1 )} 2^"^' 

i j<i 
i 

< C'JbhKC"] 



because the second-to-last line is a lower sum for b. This is an acceptable bound, so we need 
only consider the contribution of the Bg , which have the property ||.Bs ||oc < 2^~ 7 ^. This 
will be precisely what we need to control the contribution of Uj * Vj(Q). 

Since fij — Uj is simply an appropriately normalized average over all points of magnitude between 
2 J and 2- J+1 , the weak (1,1) bound on the standard maximal function tells us that 

#{n:sup|J] J Bp' ) *( / i j -i. i )(n)| > 1} < #{n : sup |6| * \nj - ^|(n) > 1} 
3 s J 

and we may indeed replace fij with v r 

Finally, if we let Q* k denote the points that are within distance 2 s of the cube Q s ,k: then 
b s ,k * Vj is supported in Q* fc if j < s. We take the exceptional set E to be the union of all Q* k , 

and note that \E\ < Cd\\f\\i- Since E is of acceptable size and supp B^' * Vj C E for all s > j, 
we only need to bound the set 



#{n:sup|^i^)*^(n)| > 1}. 



3 s<j 



This we will do using an 1? bound. 

Remark: Most of the above reductions are standard, with the exception of splitting b, 



s.k 



b^p k + B^' k by height. This is done because vj * &j(0) has no cancellation, and thus we must 



treat separately the contribution of a single delta mass in our inner product. The condition 
ll-B^lloo < 2^~ 7 ^ precisely suffices to balance the trivial bound (|3.4p in these terms. Note that 
there is no equivalent for this in the original problems on M. d , since there the convolutions have 
a singularity rather than a delta mass at the origin; because of this, the analogous endpoint 
theorems require / in the Hardy space H 1 rather than L . 



18 



By Chebyshev's Inequality, 



#{ri: sup | £ Bp> > 1} 



< 



< 



sup 



E B 

s<j 



E 



E B "' * "j 



^ 2 E E 



s<t<j 



2 E E (bP^^r 



(J) 



f J 1 (?) 

First we assume that supp B s C Qjfa and supp B\ C Qj,fc 2 , where each of these is a single 
dyadic cube of size 2 J . Then it is easy to see that \\B^ ||i < 2 dj , and thus by Lemma [3711 



B^Uv^v^B? 



< 11^*^1^00(^)1 



< C7^, e 2^- 3rf / 2 +^||^')|| 1 ||^" ) || 1 + ^, e 2^-^^ =t ||^p)|| 00 ||^- ?) || 1 



We can remove the restriction on the supports since the inner product is whenever the support 
of B^ and the support of B^' are separated by at least 2 J+1 ; thus the double sum over all Qj,ki 
and Qj t k 2 is? up to a fixed constant, a single sum. Therefore we have 



#^:sup|^^)*^.(n)|>l} < E E 2^- d / 2+ ^|| J Bp ) || 1 +^ =4 ||i?? ) || 1 

s<t<j 

|C?')|L 



< EE^ 2(7 " d/2+e)5 + 1 )ii^ J 



and clearly, this is < \\b\\i < C||/||i so long as 7 < d/2 — e. Since e > is arbitrary, we have 
proved Theorem 13.21 □ 



3.2 Plaid Random Averages on 7h d 

In this section, we take independent random variables {£i, n : 1 < i < d;n £ N} with 
1) = rT a . We define 



IJ.) '{n) 



v) >{n) :-- 



2 d(a-i)j Ylf =1 ^(w), V <\n\< 2i+\ m > for alb 
otherwise; 

$Xn) ~ nil n 7 a i 2j < H < 2 J+1 , Hi > for alii 
otherwise. 



(3.8) 
(3.9) 



As mentioned before, our decomposition of the random measure * Uj will be more com- 
plicated because the additional structure removes part of the cancellation at points where some 
of the coordinates are zero. 
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Lemma 3.3. Let e > 0. With probability 1 in CI, there exists C U)t < oo such that 

(u) ~(lu) (u) 
IC{l,...,d} 

where each Xj) is supported on {n : n\ ^ for all i G I}, and for all I 7^ 0, 

IIx^jIIoo < c^2 { - d ~^ +da+e)j ■ (3.10) 

Furthermore, 

|i/j w) * v^\0)\ < C^^-Vi . (3.11) 

Remark: We could aggregate all of the Xji with 1^0 into a single function, and for a < ^ 
the overall £°° bound would be good enough to prove the weak maximal inequality just as we 
proved Theorem 13.21 However, with a more targeted C alder on- Zygmund decomposition, we can 
do better, and obtain the weak maximal inequality for a < 1/2. Thus we can prove an L 1 
pointwise ergodic theorem for plaid sets which are just as sparse as the speckled sets we have 
proved it for. 

Theorem 3.4. For all a < 1/2, there is a set of probability 1 in Q such that for each u in this 
set, there exists C u < 00 such that for every A > 0, 

#{n:sup|/*/4%)| > A} < ^\\f\\t. (3.12) 

Corollary 3.5. For all a < 1/2, there is a set of probability 1 in CI such that every ui in this 
set has the following property: Let {a^ : k G N} be an enumeration of the set {ft : £j jni (u;) = 
1 for all i] with \dk\ increasing. Then for any measure-preserving % d -action T and any f G 
L^^X), the averages 

1 - 

An f{x) :=-Y,foT(a k )(x) 

k=l 

converge a.e. in X. 

Proof. The proof of Theorem 13.41 follows the same lines as that of Theorem 13. 2\ with one sub- 

(i) (i) 

stitution: we will have to do more when we split b s ,k = b s k + B s ^ , so that we can get better 
bounds on the pieces of the inner product which correspond to the various Xj,i- 

Let us jump ahead to that inner product to determine the bounds we will need; again we 
will assume that each of and is supported on a dyadic cube of sidelength 2?. Let 
nj G Z# denote the projection of n onto the coordinates indexed by I, and nj- £ Z^^ 1 the 
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projection onto the other d — #1 coordinates. Then 



U) 



E B[ j \n) E XjAm ~ m ^ 0)Bi j) (rnj,ni] 



mj 



< 



J2\W\n)\-\\xi\\ooJ2\ B ^^^i 



mj 



< II^Hillx^lloo IsupEl^^i^i 



"I rtj 



Just as the delta mass at for the speckled averages made it useful to define Bs so that its 
£°° norm was suitably bounded, in the plaid case we will want to define B^ so that we have 
bounds on the collection of mixed norms that appear in the last line above. As in the 

speckled case, the bounds we can obtain from properties of the support will suffice. 

That is, instead of ({3.7]) . we let Qja denote the dyadic cube with side length 2 J containing 
ft, and define for 1 < jfcl < d 



For 1 = 0, define as before 



b s (n), E \b s {m I ,nj)\>2^ d -# I ^\ 

{m I :(m I ,nf)&Q j:it } 

otherwise; 



(3.13) 



b s (n), \b s {n)\ > 2^ d - da ^ 
otherwise; 



(3.14) 



Note that this bound lacks the factor of 2 eJ ; this exact bound is crucial, since for this term we 
will have no extra cancellation in Xj,l f° r a small. 

Although we would have problems with double-counting if we simply subtracted these pieces 
from the original b s , it is clear that we can define Bs such that 

E \B^(m I ,nf)\<2^ d ~# 1 ^ 

m/:(mj,nf)eQ iiH 



(3.15) 



and 



\B®-b,\< e i^i- 

7C{l,...,d} 



(3.16) 



Thus we have split up b s with regard to the size of the relevant mixed norm on the cubes Qjfi. 
The contribution of the terms bi' is bounded as before. For the others, we will use the geometry 
of the supports and the fact that we can spare an extra 2 tJ . 



If we let Vjj := {(vj,0) : \vi\ < 2- 7+1 }, then we note that supp fij can be covered by < 
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c ^ 2 {d-#i){i-a)j copies of y jh that supp can be covere d by < 2^^ Q ( d - #/ ) +e ^'||6 s ||i copies 
of Vjj, and that \V j: i + V j: i\ < 2* I ^ +2 \ Therefore 

#{n : sup | ^ H' 1 * H (") I > 0} < EE' ( supp ^ + (supp 

i s j s 

< ^^2 #/(i+2) 2" ( ^ Q(d " #/)+e)j ||& s ||i2 (d ~ #/)(1 - a)i 



< 



\b\n 



Therefore we may reduce as before to considering the contribution of the By. The argument is 
again identical to the proof of Theorem 13 . 2 1 until the point where the inner product is calculated. 



For / ^ we have (again under the assumption, easily removed, that B^ and Bf ! are each 
supported on single dyadic cubes of size 2°) that 



,Cj) 



By> * Uj * B 



U) 



IC{l,...,d} 



4 j) *Xj,i,B { t 



(i)\ 



< 



II II,. | 



sup 



n/,n 7 



< \\B\ j \\ l 2^ d ~^ +da+e)j 2 {d ~ a[ - d ~* I)+e)j 

< wb^w^^-^* 1 ^^ 

and so the terms are bounded for e sufficiently small, so long as a < ^. 



□ 



4 Sparse Arithmetic Sets in Z d 

In the following two sections, we seek to adapt a recent construction by M. Christ |10] to a 
higher-dimensional setting. That construction takes polynomial subsets of finite groups and 
transfers them to Z d via Freiman isomorphisms; Weil's inequality on complete character sums 
implies that the uniform probability measure on such a set will be optimally pseudo-random (in 
the sense of its Fourier transform), which suffices to prove a weak (1,1) bound on these averages. 

As in Section [3j we have our choice between extending this one-dimensional result into a product 
set (plaid) construction or a natively ci-dimensional (speckled) version. (One of the means by 
which we can vary the sparse sequence's rate of growth, in fact, is to project a m-dimensional 
set down to Z d , for m a multiple of d.) The proofs will follow closely those in [10]; in the 
speckled case, we will take an isomorphism from to Z d rather than Z, while in the plaid case 
we observe that the relevant Fourier estimates factorize into a product of the estimates for the 
original sequence. 

In addition, we prove an oscillational inequality for these averages, which implies a pointwise 
ergodic theorem; the corresponding result was not included in [10] , 

4.1 A Speckled Construction 

Let c and C be constants larger than 1 and suppose that {p^} is a sequence of primes satisfying 

cpk < Pk+i < Cp k 
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for all k. (Later, we will want pk ~ 2 7fc for some 7 > 0, but this can be achieved just be repeating 
terms as necessary.) 

As mentioned before, we will construct m-dimensional sparse sets, where m = qd for some 
positive integer q; the case q = 1 gives us a "native" (i-dimensional set, while higher values of q 
give us sparser sets. So we fix a positive integer q and choose a sequence of vectors {Sk} C Z, d 
such that 

\a k -i\ + P q k -i < l^fcl ^ C Pk- 
Let [n] denote n mod p k , and define the sequence 

**j = (E^bVE^ 1 ^ ,o<j<Pk (4.1) 

\i=l i=l i=l / 

and the set 

S k = {a k + x k>j :0<j< Pk }. (4.2) 

This is the image of the set {(j, j 2 , . . . , j m ) : < j < pk} C Z™ under a suitable Freiman 
isomorphism. The main reason we have chosen such a set is that it enjoys near-optimal Fourier 
bounds thanks to Weil's theorem on charcter sums: 

Theorem 4.1. (Weil, \2J$ ) Suppose f G Z p [x] is a polynomial andp does not divide the degree, 
m, of f. Then 



E e (/w/p) 



< (m- l)p 1/2 . 



The size requirements on {3k} ensure that the collection {Sk} is disjoint; in fact, all the elements 
of Sk lie in a shell of elements whose lengths are larger than (C + l)p^ but less than Cp q k+l . We 
also have that #Sk = Pk- 

Let S = U^Sfc = {xkj : k G N, < j < pk} ■ As a consequence of the sparseness of the 
individual sets Sk and the lacunary nature of the sequence {pk}, the sparsity of the set S 
is similar to the sequence (n q ,--- ,n q ). We wish to prove the following L 1 pointwise ergodic 
theorem: 

Theorem 4.2. Let {fii}i^fq be an enumeration of {ak + %k,j} the dictionary ordering. Then 
for any measure-preserving Z, d -action T and any f G L l (X), the averages 



i=i 

converge almost everywhere in X . 

As usual, this result will follow from an 1? pointwise result and a weak (1,1) maximal inequality. 
Using the standard transference arguments, it suffices to prove an oscillational inequality and a 
weak maximal inequality for the corresponding convolution operators on Z . 

For the L 2 result, we will consider the averages on Z d , 

1 N 

A N f(v):=-J2f(v + n i)- ( 4 - 3 ) 

i=l 

We must prove the following oscillational inequality, for any given lacunary IcN: 
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Theorem 4.3. For any sequence t\ < ti < ■ • ■ toif/i t n €. I for all n, 

2 



sup \A t f-A tn f\ 

tn-i<t<t n ,tel 



<C||/||J 



(4.4) 



We will prove this (in Section r4.1.2|) by comparing At to a more standard average, such that the 
Fourier norm of the difference is small. When transferred back to the dynamical system, such 
an oscillational inequality will directly imply an 1? pointwise ergodic theorem (see for instance 
Jones, Kaufman, Rosenblatt and Wierdl [13]). 

Then, since the blocks grow exponentially in size, we need only prove the L 1 weak maximal 
inequality for complete blocks: 

Theorem 4.4. Let Sk be defined as above. Then the maximal function 

1 



n # (U fc=1 S fc ) ^^ Sk 



m) 



satisfies a weak- (1,1) inequality; that is, there is a constant C so that for any A > we have 

C 



#{n : Mf(n) > A} < — nJ \\ £1 ^ 



4.1.1 Proof of Theorem 14.41 

Again using a transference argument to reduce to a question about convolution operators on Z d , 
we seek to apply the following theorem from |10j . 



Theorem 4.5. Let G be a discrete group and 7 > 0. Suppose the sequences of functions 
//fe, Uk '• G — > C satisfy the following requirements: 

1. the maximal operator sup/% |/| * \uk\ is of weak type (1,1) on G, 

2. for each fi^, jf (supp (//&)) < C2 kl for some constant C , and 

3. ||/ * (ji k - u k )\\ P{G) < C2- fc ^ 2 ||/||, 2(G) for all f G £ 2 (G). 



Then the maximal operator sup fc \ f * * s of weak type (1,1)- 

As elsewhere, we let e(6) denote e 2md . Henceforward we will suppress our k subscripts and 
identify with its own dual group. 

If / is a function on Z™, then the Fourier transform of / is 



ft ■ £ 
V 



We then have the inequalities 

Tg 



< 



p 1 ' 



\g\\ faa , and 



V 



m/2 



f 



(•■> 



\9\\f 



£2 \\g\\tp 



(4.5) 
(4.6) 
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Finally, we will again make use of Weil's theorem for complete character sums (Theorem 14. ip . 
Let m = dq. Our goal will be to produce measures // and v' on Z™ that satisfy the analogues 
of the conditions in Theorem 14. 51 We will then apply a linear operator to move from Z™ to Z m . 
A second operator will then transfer these measures to Z '. 



Proof of Theorem \4-4\ We begin by defining a probability measure on 7L 

p-i 



1 



F 3=0 



Note that 



# (supp (//)) = p. 



Thus y! satisfies the second requirement of Theorem 14.51 if p^ ~ 2 fc7 . 
We then define a second measure: 



1 = — y 



(4.7) 



(4.8) 



(4.9) 



This will lead to a measure v which satisfies the first requirement. 

We are left with the third condition. By our inequality (14. 6h . we have that 



1/ * <y - v ')\ t i < 



{i' - V 



It remains to prove an appropriate bound on 
of fi' is 



H' - v 1 



, using Theorem l4.1i The Fourier transform 



\9) = V(n)e 



n 



1 P_1 

6 u,j 2 ,-,3 m )^ e 



n 



p-i 



3=0 



jOl+j 2 02 + ...+f 
p 



By the theorem, we then must have (for all 9^0) 



And because 



1 if 9 = 

otherwise, 



n-9 
P 
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and 

/?(0) = 1, 

we have, for 9^0 that 

while for 9 = 0, the difference is 0. 

Having found suitable measures \J and v' on Z™, we now construct measures on Z d with the 
same Fourier properties. We will first create a "smooth cutoff" version on Z^, then transfer 
this to Z d ; since the result majorizes our desired measures [i and v, we may thus obtain the 
weak inequality for them. 



< 



m 



n 



1/2 



(4.10) 



First we will build a smoothing function, (p. We identify Z^ with [— p, 2p — l] m and Z3 P with 
[— p, 2p — 1]. We define the function ip : — > M so that 



<p(n) = < 



We then define (f> : Zf p 



1 if n 6 0,p- 1], 

if n € 

affine otherwise. 



-p-i 



U l),2jp — l] , and 



as a product of 92's: 
m 

Jy>(ni). 



n 



i=i 



We will also require the functions t p : 



7m 1 ™ 



™ and r 3p 



7 m _ v 7m given by 



J 3p' 



t p0?) = ([ x i]p> Mp, [x m ]p) , and 

T"3p(#) = ([Xlhp, [X2]3p, [Xmhp) 

We can now transfer from measures on Z™ to "smoothly cut-off" measures on Z m . Define 

ri : f>°(Z™) -> r°(Z m ) by 

Tl(/)(J) = l[-p,2p-l]™(J)0(T"3p(J))/(Tp(J))- 

Let £ T m and write 9 = £/3p + 77 with £ E Z m and |r/j| < C/p for all 1 < i < m. The Fourier 
transform of Ti (/) is 

= £ r 1 (/)(J)e 



E <^)/(^)) e 

Je[-p,2 P -i] m 



J ' V I e 



LA 

3p 



Changing perspective, we may consider this as the Fourier transform on the group Z?^ of the 
product of / and = <j)(x)e (x ■ fj). By the inequality f)4.5|) . then, we have 



ri(/)(fl) 



1 

< — 

L°° (T m ) p m 
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We plan to set 



fj," = Ti(fi') and 

1/' = ^ (!/'). 



The inequalities (|4.8j) and (|4.10p will insure that the requirements of Theorem 14.51 are met for 
//' and v" on Z m , so long as < Cp m . Now 



4' 



-E 



E ^(/) e (/ , ^) e (^-) 



Je[-p,2p-i] m i=1 



n e 



X] ¥>(ii) e (fl»fc) e 
ji€[-p,2p-l] 



It therefore suffices to show 



E 



je[-p,2 P -i] V P 



(4.11) 



For £ = 0, we have the trivial bound 



je[-p,2p-i] 



< £ k(i)| < 3p. 

ie[-p,2p-i] 



For £ 7^ 0, we will pursue the required bound using summation by parts. Letting 
<p(j)e(jTj), extended periodically, and noting that A<$(n) = $(n + 1) — ^(n), we have 



E •(*(£)-(*) E<| 

je[-p,2 P -i] v F/ \j=-P v ^ 



2p-l 



£ A$(j) £ e 



2p-l 



3p 



3=~P 
I 3 



n=— p 



3=~P 



e 1*1-1 



since 3>(2p) = 
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Further, we have 



2p-l P I 

c 1 3p 



E A<3> w- 



1 1 



j=-p 



- 1 



i 

3p 



e l±- -1 



3p 



e ±- -1 



3p 



-1 /2p-l 

E A$ ^> 

. /2p-l 

E A$ ^> 

1 7 i=-p 



(j + l)g 
(j + 1)£ 



2p-l 



E A *o> I 



3p 



2p-l 



3=-P 



m 

3p 



We now apply summation by parts a second time: 



2p-l 

E A$ (^> 

i=-p 



3p 



2p-l 



A$(2p) E e 



u=-p 



2 P -i e 

E A2< ^)- 



3p 

(iM 

3p 



2p-l j 

E a2< ^) E e 

j=-p n=-p 



(n)£ 
3p 



j=-p 



1 

3p/ 

t \ X -1 2P-1 



(j + i)e 

3p 



We therefore have that 



ie[-p,2 P -i] 



3p 



< 



-2 2p-l 

i ) e A2 *o> 

3=-p 



(J + 2)g 
3p 



1 



2p-l 

E l A2 *c*)|- 



3=~P 



As a consequence, 



E 

ez 3p 



je[- P ,2p-i] V 



< ll A2 *(;)IU E 



^0 



We will estimate each factor separately, making use of the inequality \e(6) — 1| < |27T0|; a 
consequence of the fact that a chord is necessarily shorter than the arc it subtends. For the 
second factor, this gives us 



E 



1 



<VEt^ = ^ 2 - 



28 



For the second, we have 



2p-l 2p-l 

|A 2 $(j)| = J2 + 2 ) - + x ) + 

j'=-p i=-p 

2p-l 

= \<p(j + 2)e((j + 2)7?) - 2ip(j + l)e((j + l) v ) + ^OXjt/)! 
i=-p 

2p-l 

= \<fU + 2 M 2? ?) - Mj + l)e(^) + <fU)\ ■ ( 4 - 12 ) 
i=-p 



For j G [— p 



-p-i 



' 2 



3] U [|(p — l),2p — 1] this last sum is 0. For j G [0,p — 3], we have 



¥>(j + 2) = y(i + 1) = (p{j) = 1, hence 

|y?(j + 2)e(2r/) - 2^(j + l)e(r/) + = |e(2r/) - 2e(r?) + 1| 

= |e(r/) - 1| 2 < (27T77) 2 

Since fy < 1/p, this is less than 40/p 2 . The sum over this subinterval is therefore less than 
^ < 40/p. 

For j G [ ~^~ — 1, —1], we have 
\<p(j + 2)e(2r/) - 2<^(j + l)efa) + v0")l 



(^(i) + ^r)e(2r?) - 2(p(j) + -^-)e(r ] ) + tp(j) 



v-v 



< \v>0 — + 

p z 



-e(2r?) 



< 



p — 1 p — 1 

,.,,40 4 . . . . 
p z p — 1 
40 + 8vr 
p 2 



p — 1 ' 



A similar calculation gives the same bound over the interval [p — 1, |(p — 1) — 2]. Thus we also 
have a bound of C/p on these sums. 

This leaves us with only the cases j = ~ p 2 ~ 1 — 2 and j = |(p — 1) — 1. For, j = ~ p 2 ~ 1 — 2, though, 
we have that <p(j) = f{j + 1) = 0; this leaves us with 



p — 1 



The same bound holds for the case j = |(p — 1) — 1. 
We therefore have that 

2p-l 



c 



E l A2 ^)|<-- 



3=-P 



This completes our estimate (14. lip . 



Remark: In the case q = 1 (recall that m = dq), we are actually finished, since our desired 
measure p is majorized by p". We wish to generalize, however, to show that we can achieve any 
desired polynomial rate of sparsity. Therefore, we shall introduce a second operator to "project 
down" from 7L m to Z d . 
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Define the Freiman isomorphism F : 



2q 



Hi) = Y.p 1 ' 1 ^ E p l ~ [q+1) ^ 



7 / by 

n. 



dq 



i=l 



i=g+l 



i=g(d-l)+l 



(4.13) 



We note that F maps [0,p — l] m bijectively to [0,p 9 — l] d , and that for any k > and any 



ji G [-p, 2p - 1] we have Y}t=kq+iP l l 3i G [-QP q , QP q ] '■> hence, for j G [— p, 2p - l] m , we have 

F(3)£[-qp q ,qp q ] d . 



Define the operator T 2 by 



r 2 / = £ /(/). 

{r-F(J)=n} 



Now consider the operator T = T 2 Ti : £ 1 (Z^ 1 ) — > ^ 1 (Z d ). As before, we define our intermediate 
measures fjf" and u'", 



//' = T 2 (//'), and 

v = r 2 (f ), 

and we want to ensure that ||r/|| L oo( T d) < C||/||#»(zm) so that we may apply Theorem 14.5 
Now, 

ff(9) = £ r 2 rv(n) e 



n 



E 



e(0-n) £ r!/(j) 

{j:F(j)=n} 

q 
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(/</ 



E r i/(^) e *i zy _1 * + ^ E + - + °d E 



Je[-p,2p-i] m \ * =1 i=g+l 

fV (OuOip, m 1 " 1 , 2 , e q , e qP , ^p 9 " 1 ) . 



i=q(d-l) + l 



So 



r 2 ri/ 



< 



ri/ 



, and by our previous bound we have that jj!" = r 2 Ti// and u'" 
r 2 rii/' obey the difference requirement from Theorem 14.51 Further, we have that 



(4.14) 



where 



v i=l i=l i=l 

.III 



so that a weak (1,1) inequality for ji'" implies a weak (1,1) inequality for fx. We now wish to 
show that jj!" and v"' satisfy the other two requirements of Theorem 14.51 



We first observe that 

# (SUPP (Tl//)) = # (supp (l[_p )2 p_l]m(/> O T 3p // OT p )) < Cp. 
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Recalling our requirements on our original sequence of primes, and noting that # (supp (l^/)) < 
# (supp (/)), our condition on the support of (J,'" is satisfied. 

For the final requirement, we first note that # (supp Tiu 1 ) < (3p) m ; the support of u" 1 , then, 
likewise has measure less than Cp m . The supremum of u' is l/p m . Thus we have 



< Cp m sup 

n 



< C3 r 



{r-F(l)=n} 



{r-F(j)=n} 

je[o, P -i] m 



< C3 m . 

We then have a weak-(l,l) bound for sup fc |/| * |^"|. 
Restoring the subscripts, define 

Hk{n) = Hk{k -a k ), and 
v k (n) = v k (k -a k ). 



By Theorem l4,5l sup fc \ f * fi k \ obeys a weak-(l,l) inequality. Together with the inequality (|4.14l) 
this completes the proof. 

□ 



4.1.2 Proof of Theorem HT31 

As in Section 14.1.11 we will use the exceptionally good Fourier bounds of certain measures on 
Z™, and transfer these to measures on Z d using operators T p . However, in this case we cannot 
allow ourselves to use a smooth cutoff function, because we need to wind up with the actual 
averages, not simply weighted averages which majorize them. This introduces a logarithmic 
factor which would have been fatal to the weak L 1 maximal inequality, but which is harmless 
here. 



To begin, given the sequence rii = a k + x k j (where of course < j < p k ) in the dictionary 
ordering (and thus given indices k(i) and for each i £ N), we define the measures on 7L d 



1 - 



N 
i=i 



vn = — 
N 



■(M\ k(N)-l 

^P-x (M (7V) - i] d ) + E pl~ m * {M - ) • ( 4 - 16 ) 

Pk(N) k=1 V 



(in simply corresponds to an average over our sparse sequence, while vn is a weighted average 
over the ci-dimensional blocks which our sequence "lives on". We will see that /ijv and are 
made from the images under the operators T p of the measures in (|4.7p and (|4.9p on Z^!, and 
thus we may count on their Fourier transforms to be very close to one another. 
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Recalling the Freiman isomorphisms F p : Z m — > Z d from (|4.13p . and identifying Z™ with 
[0,p - l] m C Z m , define T p : -> ^(Z d ) by 

r p(/)(0 = f{ F p l {n))X[o, P <i-iy{n)- 

{F~ l {v) here denotes, for v G [0,p 9 — the unique «J G Z m such that F p (w) = ??.) This is 
the same as the final operator from the last section, except that we do not use a smooth cutoff 
function. 



By the same argument there, we may conclude that 

ll r p/llz,°°(T<i) < Cp~ m \\$\\e-(Z™)\\f\\t°°{V?), 

where now ip(n) = e(n ■ if) for some ff E T m with \rji\ < 1/p for i = 1, . . . , m. But trivially, 
|*i(Z"0 < (Cplogp) m , so ||r p /|| LO c( T( i) < C(logp) T 



loo rwvn \ 



For all of the pieces of hn and z/jy with fe < k(N), we may of course apply Weil's theorem 
on complete character sums (Theorem 14. 1 [> . However, the last component corresponds to an 
incomplete character sum, for which we instead apply Weyl's Inequality [25] to improve on the 
trivial bound; we find overall that 

||/tjv - %||x,oo( T d) < CN~ e 

for some e > 0. Thus along any lacunary / C N, we see that 

T,\\^-^*f\\p(z^<c\\f\\% m . 

tei 

Therefore an oscillation inequality for convolution with the ut would imply (|4.4p . 
As in |18j . we introduce simple Fourier multiplier operators Vt on £ 2 (7j d ), defined by 

V t f(a) = { /(d °' H^P^, 
I otherwise 

Now we can pass from an oscillational inequality for the Ut to that for the Vt, because 
Lemma 4.6. 



£ih*/-^/iil<tfii/ii2- 



tei 

Proof. This follows from the assertion 



sup V \h 

\ teI 



(a)-V t (a)\ < C. 



Fix a G T d , and take K such that \a\ ~ Vk ■ Then for t with k(t) < K, a simple calculation 
shows that 



\h(a) ~ V t (a)\ = \i> t {a) - 1| < |a|||Vi> 4 | 



< 

OO r^j 
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Note that the calculation of HVz^Hoo is the one and only place we use the growth assumption 
on afc. 



For t with k(t) > K, 



\u t (a) - V t (a)\ = \0 t (a)\ < \ J] — ^ < 

i=l 



PK 
Pk(t) 



This calculation uses the fact that vt(a) can be expressed as a weighted sum of averages over 
blocks which grow exponentially in size. Thus 



fe\A t (a) - V t (a)\) < E 
Vte/ / t£l:k(t)<K 



Pk(t) 

PK 



+ o(i)+ E 

tei-.k(t)>K 



PK 

Pk{t) 



since the p k are an exponentially increasing sequence and I is lacunary (thus the number of t 
associated to any k is uniformly bounded). □ 



We now need only to prove that 



sup \V,f-V,J\ 

tn-l<t<t n ,t£l 



<c\\f\\l 



Now we note that for t n _i < t < t n , V t f — V tn f = V t (V tn _ 1 f — V tn f). Also, Lemma I4T61 lets us 
derive a £ 2 maximal theorem for the Vt from the one for At] this follows from our i 1 maximal 
inequality. Therefore 



sup \V t f - V t J\ 

tn-l<t<t n ,tel 



sup \Vt(V tn _J-VtJ)\ 

t n <t<t n +\ ,t£l 



< c\\v tn _j-v t j\ 



And now we see that 



E W V * n -J - VtnfWl < ( ™p E \Vt n -M ~ VtM)\] E 

n J n 



(4.17) 



since the functions Vt n _ 1 — Vj n have disjoint supports. This concludes the proof of Theorem 14.3 



4.2 The Product Construction 

Suppose that m = qd and S±, S2, S r are subsets of Z d , respectively, as constructed above. 
That is, Si = U^^S 1 ^, where 



Si, k = { a hk + Epff^W' E^?^ + 'W' EX*^' 



t-l<-(d-l)q+U 



lPi,k 



: < ji < p ik 



t=l 



t=l 



and 1 < i < r. The sequences {Si jk }^ =1 and {pi,k}' k Li are not necessarily distinct with respect 
to i. 

Let 5 = ni=i Sii an d ^ Bt denote the ball of radius t in Z rd . 
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Proposition 4.7. Let S be defined as above. Then the maximal function 

M/(n) = sup 1 + 

satisfies a weak- (1,1) inequality; that is, there is a constant C so that for any A > we have 

#{n:Mf(n)>\}<j\\f\\ £1(ZTd) . 

As in the previous section, we will require two inequalities relating convolutions, products, and 
norms of Fourier transforms on these finite groups. With the Fourier transform on Yli=i 
defined by 

where Hi and £j are elements of Z™, we have the natural analogues of the inequalities (|4.5|) and 
(Oil : 



ffj 



< 



\\f*gy<\\fy\\9\ 



c 1 



\g\\gx , and 



(4.18) 
(4.19) 



Proof. The proof proceeds in very much the same way as that of Theorem 14.41 
Once again suppressing our k subscripts, let p\,P2, ---iPr be odd primes, each larger than m > 1. 
Define 



1 



rc=iPi 



Ul>32,—Jr) 

enLi[o,Pi-i] 



noting that 



We also define 



# (supp = JJpi- 



n* 



As before, we will first seek an appropriate bound on ^ — v' r 
As our products remain finite abelian groups, we have that the Fourier transform of jj! r may be 
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written 



1 



/ n x • »i n 2 • f/2 , . Br" y r 
n)e 1 h ... H 



Pi 



P2 



Pr 



E E « ( 



1 I'' v . J (ii,...j7;...;>,...jj 
lli=1 ^ (iU2,..jV) nenLi^ 
eni=i[o,Pi-i] 

1 Pi^ 1 P^-l /„-,/j. , „-2 



ni ■ X \ ( n 2 -02 



n. 



Pi 



P2 



Pr 



UUlPi 



yv ^ e f Jl ^ >1 + Jl^ 2 + ■•• + e ( 3rQ(r-l)m+l + Jr^(r-l)m+2 + ••• + j™6n 



31=0 j r =0 



Pi 



Pr 



7 1 ^— J 1 3i0(i-l)m+l + J?%-l)m+2 + ••• + jf 



i=l \ Pl A=0 



Pi 



where rTj denotes those entries of n drawn from Z Pi and 0j = (#(j_i) m +i, #(j-i) m +2j 
By Theorem 14.11 we then must have 



/// 



< 



(m - l) r 

rr V 2 "' 
Ui=iPi 



We have that 



and that fJ,' r (0) = 1. Therefore, 



4(0) 



1 if = 
otherwise, 



< 



(m-iy 



(4.20) 



T-rr 1/2 ' 

We now embark on the construction of Ti and r 2 . 

Identifying Zgl. with [— Pi,2pi — l] rn and Z 3pi with [— Pi,2pi — 1]. We define the functions 

9?i : Z 3pi ->■ R by 



ipi(n) = < 



1 if n G 

if n G 

affine otherwise. 



0,Pi - 1], 

2 



U [|(pi - l),2pi - 1] , and 



With Hi = (n i; i,nj i2 , ■■■,n i>m ), we then define </> : n[=i %T Pi K by 

r m 

= n n & ( n *> fc ) ■ 

i=l k=l 

Define r p : Z™ -> n[ =1 and r 3p : Z™ -> TlLi by 

Tp(^) = ([&l]pu felpu •■•) [fem]pi) [^m+l]p2i [^m+2]p2) •■■■> [^2m]p27 ••■> [^(r— l)m+l]j>r ' •••■> [^rm]p r ) ; an d 
T"3p(fc) = ([^l]3pi> [^2]3pi) ■••) [k m ]z Pl , [k m+ i]s p2 , [k m+ 2]3p 2 : —■> [^2m]3p 2 ; "J [k(r-l)m+lhp r i ••■> [krm\Zp r ) 

We now define T±: 

r i(/) = 1 nr=i[-pi>2 Pi -i]»'0 r 3p/ ° t p . 
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Suppose 9 G T rm and let & e Z m so that 

9 = (|i/3pi +%,6/3p2 + l?2, -iCr/Spr + Vr 



with Ir/j^l < C/pi for all k and i 

If / : n[=i ~~ ^ ^> then the Fourier transform of T\f would be 

fTf(9)= £ TifiMho) 

^2 <t>{j)f T p(i)e(ji • ??i + ••• + jr • ( ji • ^- + ... + j 



jenr=i[-Pi.2 Pi -i] 

Letting 



3p r 



(4.21) 



= 0(f)e(xi • fji + ... + x r ■ ff r ), 



we find that the expression (|4.2ip is the Fourier transform of the product of / and tp on Yll=i ^3pi ■ 
By ()4.18p . we have that 



ri(/)(f) 



< 



In this case, we have that 



E 



3 ?! 



X] Hl) e (ji ■ Vi + ••• + Jr • ?7r)e [ ^r-^ + ••• + 



E 



3pi 

3'erK=i[-Pi. 2 Pi- 1 ] m v 

/ r m \ 

( n n ) ■ ^ + ••• + > ■ ? ^) e 

Jenr=i[-p<>2pi-i]- 



vi=i fe=i 



3p r 

3pi 



+ ... + 



jr ' £r 
3p r 



nn e 

i=i fc=i£ ijfc ez 3 p. 



^2 v(ji,k)e(ji,kVi,k)e 



ii,fee[-pi,2pi-i] 



3pi 



If 



E 



Ji,feS[-pi,2pi-l] 



ji.kC 



3pi 



then we are done- but this is precisely the inequality ()4.11|) . 
Suppose m = g<i. Define F : 7L rm — > TL Td by 



dq 



F(j)= [Y.Pl^jhk,.-, £ 

fc=g(d-l)+l 



fe-(g(d-l)+l) 



Ji 



k-(q(d-l)+l) 



Jr,k 



,k=l 



k=l 



k=q(d-X)+l 



Once again .F acts as a bijection on our sets of interest; here we have that F is a bijection from 
111=1% Pi - !] m to IlI=J .Pr - !]■ We again define T 2 by 

r 2 /(n) = £ /(J). 

{j:F(j)=n} 
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As in section \37l\ we have that 



r 2 rv 



< 



ri/ 



Setting fjf" = T2Tifi' r and u'" = r^Tii^., we then have that fj,'" and v"' meet the difference 
requirement of Theorem 14.51 
We also have that 

r 

# (supp (r 2 ri/4)) < # (supp (r^)) < c]\ Pi , 

i=l 

and that 



K"IU < #( su pp {v'")) ||r 2 riz^.|| €00 

1 nr=i[-p I -2 Pl -i]-</ , °T3p 



i=i 

< c. 



p^sup 



{j:F(j)=ri} 



ni=iP m 



As in the previous section, then, all three requirements will be satisfied; noting that 

^(H)> r ^—^2dj(n), (4.22) 

where 

~ s = { (z^iW, E^bf - 1)q+t U, k , t^ d - 1)q+ %) = o < * < 1 < i < 4 

I \t=l t=l t=l ) J 

it remains only to reintroduce our subscripts and to shift by ^. 
Defining and by 

/i fe (n) = ^' fc (n) 
ffe(n) = ^'' fc (n), 

we have that sup fc \ f * obeys a weak-(l,l) inequality. With this and the inequality (|4.22p . 
the proof is complete. 

□ 

We have that a product of sum sets of the type constructed in Section (|3.ip remains a good sum 
set. 



5 Sparse Sequences and Actions of Virtually Nilpotent Groups 

We begin with a few necessary definitions. 

Definition Let G be an infinite finitely generated group with identity e, and A = {e, a\, . . . , a n } C 
G be a finite symmetric generating set containing e. Let denote the elements of G expressible 
as words of length N in A, and let A := {e}. Then p A (g, h) := min{N : gh~ l £ A^} defines a 
metric on G. 
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Select a symmetric set of generators A. Then A is the ball of radius N in the word metric on 
G. Classical results by Wolf Bass pQ, Milnor and Gromov [T2] amount to the following: 
G is virtually nilpotent (contains a nilpotent subgroup of finite index) if and only if there exists 
rfeN and < c < C < oo such that for all N G Z+, 

cN d < #A N < CN d (5.1) 

(here C, but not d, depends on the choice of A). Thus we say that G has polynomial growth of 
degree d. 



Pansu [T7] improved this result further: 

Theorem 5.1. Let A be a symmetric set of generators for the virtually nilpotent group G. Then 
there is an integer d so that the sequence 



#A 



N 



N d 

converges. 

Note that in particular this implies, for all g £ G, 

]im «A^) 

A^oo #A W V ' 

Since the important matters in the proofs that follow do not depend on our choice of A, we will 
henceforward suppress it in superscripts. 



5.1 Block Averages for Virtually Nilpotent Groups 

In the proof below, we will again seek to apply Tempelman's Theorem to a shifted sequence of 
sets whose volume increases in a lacunary fashion. Instead of rectangular prisms, however, we 
will take as our shifted sets elements of the family {A^}. 

We immediately note that for any particular radius, the volume of the corresponding ball is 
finite, and that for any m > n, A™ C A m . Further, the family of balls of radius N, {A JV }Ar > o, 
themselves satisfy the Folner condition (see [6]). 

Choose the sequence 4 as in section 12.21 an d a sequence of elements £ G, with 

1. p(a k+ i,e) > p(a k ,e) +4, and 

2. 4 > Cp(a fc _i,e). 

Defining B k = A^ k , we let S = U k> oa k B k . 
Suppose 

S(k,r) = (U i<k+1 aiBi) Ua k+1 A r , 

where < r < l k . 



Proposition 5.2. The sequence S{k,r), with k > 1 and r > 0, is a pointwise L l -good sequence 
of sets for any free G-action. 
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Remark: For the proposition as it is written, merely having the upper and lower bound (as in 
15. ip would suffice; however, Theorem 15,11 gives us more. Specifically, we have that an average 
over any increasing sequence of sets of which our Folner sequence is a subsequence must also 
converge. For example, we have that the averages taken element-by-element also converge, so 
long as we successively fill each set in our constructed sequence. A similar result holds in the 
Z rf case, as an immediate corollary. 

Note that if we consider only r = 0, we would have an L 1 -good sequence corresponding to the 
sequence consisting only of whole blocks in the original block sequence construction. 

Proof. As in Section [2j we need only verify the difference requirement. 
Letting = Uj<fc + iaj.Bj and R = afc + iA r , we have 

# (S(k, r)S-\k, r)) < # (S^T 1 ) + # (S fc iT x ) + # {RS^ 1 ) + # (RR^ 1 ) . (5.3) 
We note that for any ball A^ in G, we have 

A^ (A^CA^. 

Hence, due to the polynomial growth of G, the size of the difference of any ball with itself is 
bounded by the size of the original ball: 

# (A w (A^)" 1 ) < #A 2N < C (2N) d < 2 d #A N . 

Thus the last term in (|5.3p is less than C JfR for some constant C. 

For the first term, we again consider a decomposition: 

# (S k S^) < # (ofclMS- V) + # Ma^i) + # 0&-iUfc V) + # {s*-is£i) ■ 

Again, we immediately have that the first term is less than C#Sk- But Sk~i C B^, by our 
condition on the So each of the three other terms is also less than C^S^. 

This leaves only the second and third terms of (|5.3p . We note that, by our conditions on l^, 
Sk Q A rffe for some constant c. For the second term, then, we have 

# (SkR- 1 ) < # (A^iT 1 ) < C# (S(k,r)S-\k,r)) . 
In a similar way one may show that the third term is less than C# (S(k,r)S~ 1 (k,r)Y 

□ 

5.2 Random Averages for Measure-Preserving Group Actions 

Let Q be a probability space, let < a < d, and let : g £ G} be independent 

{0, l}-valued random variables on £1 with P(£ 9 = 1) = p(g,e)~ a . Note that by Theorem 15.11 
and the Strong Law of Large Numbers, there exists C depending on G, A and a such that 
P ( N a ~ d Yl g eA N £s — * C ) = 1- We restrict ourselves to this set J7i of probability 1. 
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Definition For a measure-preserving group action (X, F, m, {T g }) and/ € L 1 (X), define the 
average 



and the random average 



g ek N 



Krengel proves several theorems about measure-preserving group actions and other additive 
processes in Section 6.4 of [14]. We will apply Theorems 4.1, 4.2, and 4.4 from that section to 
our particular case: 

Theorem 5.3. Let G have polynomial growth of degree d, and A be a finite symmetric generating 
set. Then for every measure-preserving group action (X,J-,m,{T g }) and 1 < p < oo, A^f 
converges in LP and a.e. for every f £ L p (X,m). 

Let G have polynomial growth of degree d, and A be a finite symmetric generating set. Then 
we have a weak-type maximal inequality on G itself, 

1 C 

#{g£ G:sup|p*^^l A Ar| > A} < -|M|i for all ip £ £ l (G). (5.4) 
We may now state our main results: 

Theorem 5.4. Let G be a finitely generated group with polynomial growth of degree d, and A a 
finite symmetric generating set, and < a < d. Then there exists ^Cfi with P(^2) — 1 such 
that for each uj £ O2, A^t ■ f converges in L 2 and a.e. for every measure-preserving group action 
(X,J-,m,{T g }) and every f E L 2 (X,m). 

Theorem 5.5. Let G be a finitely generated group with polynomial growth of degree d, and A 
a finite symmetric generating set, and < a < d/2. Then there exists ^Cli with ^(^3) = 1 
such that for each uj G f^, A^ f converges in L 1 and a.e. for every measure-preserving group 
action (X, T, m, {T g }) and every f E L l (X,m). 

5.2.1 Proof of Theorem IB^4l 

The analogue of Theorem 15.41 was proved by Bourgain [5] using the theory of exponential sums, 
and this technique extends to the natural analogues in Z d . However, on virtually nilpotent 
groups the Fourier transform is not so easy to work with, and so we will prove the L 2 theorem 
using the TT* method and a lemma from combinatorics. 

It will suffice to prove convergence of the A^ f along a suitable subsequence. Indeed, fix 
an increasing sequence {Nj} C N such that ^ +1 — > 1. Then for any / > and Nj < N < Nj+i, 

^)"°<»/<^/<(^f)"°^' +1 /. (5.5) 

Then under the assumptions of Theorem 15.41 it suffices to prove that A$f converges in L 2 and 
a.e. for all / £ L 2 (X). We may assume that {Nj} is superpolynomial; i.e. 2V,- 3> j for every 
C G N. 
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We will compare these random averages to their expected value, which is a weighted average of 
the standard ergodic averages. Define 

N 

a N f(x) := E w A ( ^f(x) = N a ~ d £ p(g,e)- a f{T g x) = '£a n , N A n f( k x), 

g£A N n=0 



where a n n > 0, > a n n = 1 for all N, and lim a n n = for all n. Since A n f converges in I? 

' JV->oo 

n=0 

and a.e. by Theorem Kl, clearly a^f converges in L 2 and well. 

We will prove Theorem l5.4l bv showing that there exists a set C f&i with ¥(^2) = 1 such that 
for every wGfi2, 

\\sup\A { £ ] f -a N .f\\\ 2 -> as fc -> 00 for all/ G £ 2 (X), (5.6) 

j>k 3 

which immediately implies A^'f — <?Njf ~ > in L? and a.e. 

As in [1] and other papers, we hope to transfer the corresponding maximal inequality from 
the group algebra £ P (G). This Calderon transference principle is practically identical to the case 
G = Z, but it is necessary to prove it in this general setting. 

Lemma 5.6. Let G be a group with polynomial growth, and (X, J 7 , m, {T g }) be a measure- 
preserving group action; let {a g j} C C such that Y2geG\ a 9,j\ < 00 for all j. Set Ajf = 
J2 g eG a 9J T gf and Mj = E ff eG a fl,A- 

For any 1 < p < 00, if \\supj \tp * (J>j\\\p < Co ll^llp f or a ^ £ t p {G), then || sup^- |Aj/||| p < 
Co||/||p /or a/// G 

if instead || sup^ |V ; *A t j| ||p,oo < CollV'llp for allip £ £ P (G), then || sup,- || PjO0 < Co||/||p for all f £ 
D>(X). 

Proof. We first consider the strong maximal inequality. It is enough to show that || sup 1<J< j \ Ajf\ \\ p < 
Co||/||p for all / G L P {X), for each fixed J G N. We may further assume that the supports of 
the Hj are finite, and let 6 := Uj=i SU PP Afj- Take a finite symmetric set A that generates G, 
and the sets A^ defined in Section 15.21 Fix x G X and a large finite K G N, and define <£> on G 

y ^ yy > \ otherwise. 
Then Ajf(T g x) = <p* /j,j(g~ l ) for all g G A^ and all j < J. This completes the proof for p = 00; 
for p < 00, 

V sup |^-/(T 9 x)| p = V sup |^*^(p _1 )|p < || sup \(p*Hj\E 

= c p £ |/(T 5 x)r. 

Integrating over x £ X, 



^1^-/111? <CS«^±S||/|| ? ; 
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we let K — > oo and note that (|5.2p implies (with Co independent of J) 

|| sup |^/||| P <C ||/|| P . 
i<j<J 

For the weak inequality, we similarly derive 

\v#{g£A K : sup \Ajf(T g x)\ > A} < (%\\<p\\* 
i<j<J 

and integrate this in the same manner. □ 

Proof of Theorem 15.41 (Continued): We will transfer this problem to £ 2 {G) using Lemma 
15.61 Let r] g {ijj) = igioj) — p(g,e)~ a ; these are independent mean Bernoulli variables. Define for 
each j the random measures 

u^(a) - i ^TV")' 9eANj (5 7) 



Then for ip £ £ P {G), we have the random averages ip * Uj'(h) = N" Yl ge A N j ^gi^Pi^d 1 )> 

which correspond to the operators A^? —un- in the sense above. Theorem 15.41 therefore reduces 
to verifying that with probability 1 in f2, there is a sequence Ck,ui —> such that 



||sup|^*i/j w) ||| 2 <Cfc^||#j for all^S £ 2 (G). (5.8) 

j>k 

Since || supj> fc \ip * v^\\\l < || J2j>k * I Hi = Ej>fc * v j^\\h [t clearly suffices to prove 
that 

00 

Ell Mil 2 ^ 
\Wj \\op < OO, 

i=i 

where || • \\ op is the norm of the convolution operator on £ 2 {G). 

Since in this context we do not have the Fourier transform to help us, we will use a differ- 
ent Hilbert space technique: the TT* method from harmonic analysis. 

For any operator A on the Hilbert space £ (G), the operator norm \\A\\ = \\A*A\\V 2 = IKAM^II 1 ^; 
for the convolution operator Af = fj, * f, the adjoint operator is simply A*f = p, * f for 
p,(g) := /x((7 _1 ) (G is discrete, thus unimodular). Thus we have the trivial bound ||-A|| op < 
||(/2 * \\op 2AI < IKA * ^) M ||£i 2M ) and thus any cancellation in the convolution products will 
make itself known in the original operator norm. (Here and in what follows, we use [i n to denote 
the n-fold convolution product //*//*•••* /x.) 

The cancellation in this convolution product can be described in terms of additive combina- 
torics on G: if we take a random subset E C with size 3> (ftA 1 ^) 1 / 2 ^ 1 , then the number 
of ways to write any element of A 2MN as a product gig^ 1 ■ ■ ■ S2M-192M with all g$ € E should 
mostly be quite close to the "average" number of ways to do so. The quantitative version of 
this is as follows: 
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Lemma 5.7. Let G be a group and E a finite subset. Let {X g } g ^E be independent random 
variables with \X g \ < 1 and EX g = 0. Assume that S 96 #Var X g — 1- Let X be the random 

i l (G) function J2 9& e X 9 6 9- Then E\\(X * X) M \\% < C M {Y, g& E Var x g) 2M > where C M depends 
only on M . 

Proof. 



e{\\(x*x) m \\%) 



E E 

96G 



g 1 h 1 1 ...g M hJ = g 



X n Xh • • • X n Xh 



E 



/ 



E(X 9l X hi . . . X g2M X h2 



g 1 h 1 ...g AI h M — g AI+ ih M+1 . . ■ g 2M h 2M 



For any of these terms, if some g £ E appears exactly once among the gi and hj, the expectation 
of the term will equal by the independence of the X g . Therefore we can sort the remaining 
terms based on the equalities between various gi and hj; namely, in correspondence with the set 
partitions of {1, ... , 4M} in which each component has size > 2. Let there be Cm of these. For 
a fixed partition A = (Ai, . . . , \ q ), we can majorize the sum 

E nx gi ...X 92M X hi ...X h2M ) < J2 E(|X 9i |^l)...E(LY 9 

. . .,g 2M , h lt ..., h 2M ) 6 A g lt ...,g q £E distinct 

9 v h t &E 

< V EX 2 ...EX* 

— Z / fl 9q 

= (£VarA^<(]TVarX g ) 2M 



g£E 



g&E 



since E\X g \P < \\X g \\^ 2 EX 2 < EX 2 for p > 2, *£ geE V&rX g > 1 and q < 2M. 



Thus E(\\(X * X) M \\%) < C M (E g eE Var ^ 



\2M 



□ 



Proof of Theorem 15.41 (Conclusion): Now by Holder's Inequality and the fact that i/j *Vj) M 
is supported on A 2MN i , 

\\fa*i>j) M \\i < ll(^*^) M ||2(#A 2ikfAr 1/2 < m*y j ) M hC(2MN j ) d / 2 . 
By Lemma [5T71 since Varr/ 3 < p(g,e) a , 

E(||(^ * vf ] ) M \\l) < Nf' I{a ~ d) ■ C M ( Var % ) 2 ^ < C d , a , M N 2M ^ 



and therefore by Chebyshev's Inequality, 

nil(^ H *^ H ) M ||i>A) < p(||(pf ) *^ a,) ) M ||iC 2 (2MiV j ) d >A 2 ) 

< C\- 2 M d Nf-E(\\(v!j u) *^ u) ) M \\h) 



< Cd,a,M A iVj 



2 , r 2Ma-(J(2M-l) 
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As a < d, take M,5 > such that d(2M - 1) > 2Ma + 5. Take A = j- M ( 1 + e ) ; s i nce N? is 
superpolynomial, ^ • j 2M ( 1+e ) NJ & < oo so by the Borel-Cantelli Lemma, there is a set 0,2 C f^i 
of probability 1 on which * < C w .T M(1+e) for all j and thus Y%Li IWj^Wlp < 

Cu Y2j^iJ ~ e < oo. This completes the proof of Theorem 15.41 

5.2.2 Proof of Theorem 1531 

By Theorem 15, 4[ for a; G we have a.e. convergence of A^ f for / G L 2 (X), which is dense 
in L 1 (X). We therefore need only a weak type maximal inequality to prove Theorem 15.51 As 
usual, it is enough to consider the dyadic subsequence 2 K Now for / > 0, < f < A^\ 1 f 
for 2 J < N < 2- ?+1 , so it suffices to prove 



suplA^/llkoo < CH/lli for all / G L\X). (5.9) 

3 



Again, we will use Lemma 15.61 to transfer this maximal inequality from fi-(G). Let 

tf\g) — > * 



1 0, ^A* 



\ 0, A 2J 

Hj and E/ij correspond to the operators and cr 2 j , respectively. Theorem 15.51 reduces to 
proving 

|| sup \lf * //^|||i,oo < Ca,||^||i. (5.10) 
3 

Proposition 5.8. Let fj,j and Vj be sequences of functions in ^(G), where G has polynomial 
growth of degree d. Let rj := #{g : fJ-j(g) ^ 0} and take Rj := inf{i? > : Vj{g) ^ =>• 
p(<?> e ) < R}- Assume there exists Co < oo such that ^j<fc r i — Co r fc f or a llk £ N, and that 

V j * Vj = 0{r-j X )b~ e + 0(Rj d ~ e ) for some e > 0. (5.11) 

If for all ^, || sup ip* \fij — Vj\ ||i jOC < C||<^||i and || sup |<£>*//j| \\ P ,oo < CplMlp / or some 1 < p < oo, 

3 

then 

|| sup \if * Hj\ ||i jtX) < C'H^lli for all ip £L l l (G). (5-12) 
j 

Proof. This is simply an extension of the proof of Theorem 13.21 however, we must first establish 
that the Calderon-Zygmund decomposition makes sense on more general groups G. Since word- 
length is a quasimetric on G, we can use the p-dyadic cubes constructed by Christ in [§] on spaces 
of homogeneous type. Namely, there exist a collection of subsets {Q s .k C G : s G N, k G Z}, and 
constants A > 1, cto > 0, Ci<oo such that 

Vs G N, G = \J k Q Sjk (5.13) 

r < s Q rjJ C Q Sife or Q r>i n = (5.14) 

V (r, I), Vs > r, G Z such that C/r,i C Q s ,k (5.15) 

Diameter Q s<k < dA s (5.16) 

Each Q S) k contains some ball of radius oqA s . (5-17) 



44 



Because G has a polynomial rate of growth, p is a doubling metric, and thus we can prove the 
Vitali Covering Lemma and the Hardy-Littlewood Maximal Inequality on G. Using a standard 
stopping-time argument, we can then define a suitable discrete C alder on- Zygmund decomposi- 
tion on G with the dyadic cubes. 

Fix A > 0. We take 99 = q + b, where ||fl||oo < A and b = ^2 b S:k for some index set B C N 2 , 

(s,fc)eB 

where b sk is supported on Q Sl ki {Qs,k '■ (s,k) £ B} is a disjoint collection, ||6 Sj fe||i < \#Q s ,k and 

#Q s ,fc < ^"IMIi (C independent of c/? and A). Let b s = b sk . 

(s,k)eB k 

We further decompose b S)k = b^l+B^, where b[ j) k = 6 a ,fcl(|6,, fc | > Xrj). Define b^P ,b® , 
by summing over one or both indices, respectively. 

We will divide B^ = Bg into two parts, splitting at the index s(j) := min{s : A s > Rj}. 

Now {g : supj \ip * fij(g)\ > 5A} CfiiU^U^U^U E s , where 

Ei = {g: sup \g*fj,j(g)\ > A} 
3 

E 2 = {g : sup \b^ * fJ,j(g)\ > A} 
3 

E 3 = {g: sup \B® * (fij - Vj )(g)\ > A} 

E 4 = { 9 :sup| ( jr B W * 1/^)1 > A} 

J V=s(j) / 
/*(?)-! \ 
B 5 = {<?:sup| Yl B * } > A} 

* \ s=o J 

By the weak (p,p) inequality (if p < 00), #E\ < CA~ p ||g||p < CA~ p ||fj||£r 1 ||fl||i < CA _1 ||y?||i; 
if p = 00, re-do the decomposition so that ||fl||oc < C^X instead; then E\ will be empty since 

|| SUPj |0 * MjHIoo < CoollflHoo < A. 

Next, 

#E 2 < Y,#{9 ■ \b U) * fij{g)\ > 0} < #( su pp Mi) • #{9 ■ Ug)\ > Xtj} 
3 3 

3 k>j 

= Y*{9-Xr k <\b(g)\<Xr k+1 }Y r 3 



< 



3<k 

— Y Xr k#{g ■ Xr k < \b(g)\ < Xr k+1 }; 



now note that this sum is a lower sum for |6|, and we have j^E 2 < Co A 1 \\b\\i < y \\f\\i- 

For E 3 , |BW * (/i, - ^)( 5 )| < |BW| * \ H - Vj \(g) < \b\ * \ H - Vj \(g), so by the weak (1,1) 
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inequality, 



#£ 3 < #{<? : sup \b\ * \n s - uj\(g) > A} < jplU < j]^. 



To bound E4, note that for all s > s(j), B^l * Uj is supported on Q* k := {g : p(g, Q S} k) < A s }, 



so 



(s,k)eB 

We have thus reduced the problem to obtaining a bound on the size of E$. 

Lemma 5.9. Let Bg be as above, and assume the Vj satisfy \5.11\) . For < s < s(j), 

\\B® * Vi\\l { G) < Crj l \\B^\\l + CXT*\\BV\ 

and for < s\ < S2 < s(j), 

\{B$ * Vj ,B^ * u,) £2(G) \ < C\2-v\\Bi%. 

Proof. We first restrict the supports of the B s ; we assume there is a Q s (j),k such that Q S: k C 
Qs(j),k for all (s,k) e B with s < s(j). Then H^Hi < ||6 s ||i < J2( s ,k)eB MQs,k\ < MQs(j),k \ ^ 
CXRlj, and thus 

\(BV>* Vji B$*Vj)\ = \(Bi{> *v j *9 j ,Bi{>}\ 

< CrJ 1 \(B^,Bi{>)\ + CR- d 2^^\B^\\ 1 \\Bil>\\ 1 

< Cr7 1 |(Bg^Bg))| + CA2-^'||Bg)|| 1 . 

Now this first term is if si ^ S2, and CrJ^WB^ ||| if si = S2- 

We remove the assumption on the supports by noting that if the distance between the sup- 
ports of (pi and if2 is greater than 2Rj, then (ipi * uj, if2 * Vj) = 0. Thus if we decompose each 
B s = Y2k-Bsl(Qs(j),k) an d decompose the inner products accordingly, all but finitely many of 
the terms (a number independent of j) will vanish; and those remaining can be estimated in 
this way. □ 

Now by Chebyshev's Inequality, 

s{j)-i 

A 2 #{ 5 :sup| £ B(j)*v j ( g )\ > \} < ]Tsup| Y, B^*u,(g)\ 2 
i s=0 g i s=0 

•(j)-l 

< Ell E Bi s 3) *^W 2 2 (5-18) 

j s=0 

< E E \( B $ * v 3, B n * v i)p{<3)\ 

0<S!,S2<s(j) 
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and this is 

<EE (Cr7 1 ||5W||l + (7A2-^||B(i)|| 1 )+2^ ]T C^H^Ha 

0<si<s 2 <s(i) 

oo oo s 0')-l 

s=0 j=l j s=0 

oo 

<E c *iii + E E ^7 1 ii^' ) ni- 

The first term is < CA||(^||i as desired. For the second term, note that 

oo 

E r i - C o r k for all & G N =>- 3N s.t. r j+n > 2rj for all j G N,n > iV =^> E 7 "/* - ^fc^ 
Since the Q Sj k are disjoint, for a fixed 5 G Q so ,k > 

E E ^W } G?)| 2 < E ^" 1 |^o(5)| 2 < CA|6 S0 ( 5 )| = CA|6( fl )| 

*rj > |6 S0 ( S )| 

so EjEto^^ll^lH < CA||6||i < CA||^||i and the proof of ([5TT2]) is complete. □ 

Having established Proposition 15.81 it remains to show that the random measures /Jij and z/j 
satisfy the assumptions with probability 1. Note first that rj = |supp (J>j \ = E 36 a 2J ^s( w ) ~ 
2 < y d ~ a )i on fii, and i/^ is supported on A 2J with p-diameter at most Rj = 2 J+1 . We must prove 
the bound (|5.1ip on Uj * . 

Lemma 5.10. Let G be a group and E a finite subset. Let {X g } g£ E be independent random 
variables with \X g \ < 1 and ^X g = 0. Assume that ^2 g£E (Var X g ) 2 > 1. Let X be the random 
(^{G) function ^2 geE X g 5 g . Let G x denote G \ {e}. Then for any 9 > 0, 

|X * X||^x> (G x) > 6C^2(VsLrX g ) 2 ) 1/2 J < 6|£| 2 max(e- e2 / 36 ,e- e / 6 ). (5.19) 
Proof. For /i/e, 

x*x(/o= E E y 5 

g£Enh~ 1 E g£EDh- 1 E 

where KY g = and |Y 9 | < 1. We want to apply Chernoff's Inequality, but the Y g are not inde- 
pendent. 

We can, however, partition E n h~ 1 E into at most three subsets E\ , E2 , E3 , in each of which 
the 1^ are independent. To see this, note that we can make a directed graph with vertex set E 
and edge set {(g,hg) : g,hg £ E}; and that the components of this graph are paths or cycles. 
Thus we can three-color this graph; and within each resulting Ei, the Y g depend on distinct 
independent random variables, so they are independent. 
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Now Y g has variance 

g£Ei 

<J 2 =Y1 VarX g VarX gh < ^(VarX ff ) 2 < ^(VarX 9 ) 2 

g&Ei g&Ei gdE 

by Holder's Inequality. Chernoff's Inequality (Theorem 1.8 in [21]) gives us 

F(| Yl Y a\ ^ AfJ ) ^ 2max(e- A2/4 ,e- ACT/2 ). 

Take A = ^ _1 (E fl6 £;( Var X s) 2 ) 1/2 5 then A > 6> and Act = #(£ g6£ (Var Xg) 2 ) 1 / 2 > (9, so 

3 

P(|X*X(/t)| > 3^(^(VarX s ) 2 ) 1 / 2 ) < - AcJ ) - 6max(e^ 2 / 4 , e~ e / 2 ). 

i=l Ei 

Since this holds for each /i/e and |supp X * X\ < \E\ 2 , the conclusion follows (after replacing 
36* with 6). □ 

Corollary 5.11. Let z/j 6e f/ie random measure defined as before, < a < d/2 and k > 0. 
TTien i/iere is a set ^3 C ^2 with P(^3 = 1) such that for each oj € ^3, 

i/H * z>j w) = O^-^Se + O w (2 2 ( Q -^( E^) 1 / 2 2 KJ '). (5.20) 

9 6A 2i 

Proof. For the bound at the identity e, we use the fact that 



< 2 2 ( a - d ^' (E^ + e 9 H) = 2( Q -^' +1 + 2 2 ( Q -^' J7 fl (w) 



so that 

P^H * j)f\e) > 3 • 2^'^) < P( ^ ? ?9 (w) > 2^^') < 2exp(--2^ a ^') 



for j sufficiently large, by Chernoff's inequality. The Borel-Cantelli Lemma then implies that 



M * r.M, 



V 3 * V 3 



e) < 3 • 2^ a ^ for j sufficiently large (depending on u>), so there exists C w with 



< uf' * vf\e) < C^-^i for all j. 
For the other term, we note that Varry 9 < E£ g , so we set = 2 KJ and apply Lemma l5.10t 

2^ d -^\\vfUvf ) \\^ {G , ) > 2 K \ E ^ 2 ) V2 ) ^ C2 2 *'exp(-2 K V2) 



which sum over j. The Borel-Cantelli Lemma again proves the bound holds with probability 
1. □ 
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Note that ^ E Zg < ^ {d ~ 2a)i \ thus for a < d/2, 



2J 



1/2 

2 2(a-d)i | V Eg I 2 Kj < C2^ +a+K ^ = CRj d 2^i +a+ ^ j 



and thus for k chosen small, the measures i/j satisfy the bound (|5.1ip for all w £ O3. Since 
Hj — = Efij is a weighted average of the nonnegative averages in (|5.4p . Theorem K2 
implies || sup^ \tp * E/ij|||i )00 < (7||y>||i; and the maximal inequality for fj,j is trivial. Thus 



Proposition 15.81 applies, and we have proved Theorem 15.5 

Remark: As in the case of Z d , this method is inherently limited to exponents a < d/2, because 
otherwise the set is too sparse for a single convolution product to be "uniformly" small in any 
nontrivial sense. 

5.3 Gaps and Banach Density 

In [15] it was noted that, with probability 1, the sparse random sequences in N have Banach 
density 0, which distinguishes them from block sequences of the Bellow-Losert type in Section 
[2j This remains true for the random subset {g G G : £ g (u>) = 1} which we have obtained. 

It is worth noting a second distinction: with probability 1, this random set has a subset with 
gaps tending to infinity which is of full relative measure, and thus the averages over this subset 
still converge a.e. for functions in L . (Note that this is not a necessary consequence of Banach 
density 0: consider the "Cantor set" of natural numbers that can be written as finite sums of 
distinct powers of 3. This set has Banach density 0, but no set of positive relative measure can 
have gaps tending to 00.) 

We will order the random set {g n } = {g £ G : = 1} so that p(g n ,e) is nondecreas- 

ing. The convergence of the ergodic averages in (|5.5p implies that we can add the points 
one at a time and maintain the pointwise ergodic theorem; that is, the averages 



71=1 

converge a.e. in X for any measure-preserving group action (X,T). 
Definition For j > and M < 00, let 

:= {n € [2 J ,2^ +1 ) : inf p(g n ,g m ) < M} (5.21) 
Pt,M ■= 2-j\T t , M \. (5.22) 

Proposition 5.12. Let {g n } be a sequence in a virtually nilpotent discrete group G, such that 
p(g n ,e) is nondecreasing and the averages A^ n ^ f converge a.e. for all f 6 L l (X). If Pj,M < 

3 

00 for every M < 00, then there exists an increasing sequence {ra^} G N such that {g nk } has 
gaps tending to infinity, and such that the averages A^" k ^ f converge a.e. for all f G L X (X). 
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Proof. We can clearly choose a sequence Mj with Mj — > oo so that ^ ■ /3j,Mj < oo. Let {n/t} be 
the set N \ (|J • Tj t Mj), taken in increasing order, and note that ^ — > 1. 

Considering as the union of terms in {n^} and the complement, we see 



< 



n<riK- 



n K -K 



Clearly the L 1 norm of the right-hand side is bounded by — ^ 
more, we have the weak maximal inequality 



as K — > oo. Further- 



sup 

K 



^ E f( r (9, 



U, rj,Mj 



< 



sup 

J 



2" J E fVignM 

n<' 

^ E^. M i <o °- 



n<2-?: 
3 r j,^ 



I (9n h ) 



Therefore A^" k ' f — ^jf-An^ f — )■ a.e. and since ^ — > 1, this proves that A^ nk ' f converge 
a.e. for all / € L l {X). □ 

It remains to show that our randomly generated sequences indeed have this property. If B is 
the ball of radius 2M centered at g, and p(g, e) ~ 2 l 3> M, then clearly 



,(SnJ 



(5.23) 



If ,6; is a cover of {g £ G : p(g, e) ~ 2'} by balls of radius 2M, with multiplicity of intersection 
controlled uniformly in I, then for j = (d — 7)/, 



|B| /|R|\ 
/3„m < 2-^|E(*-l) ' ' 2 



f=2 
2jl-j 



< C M \Bi\T 

< C M 2 l(d - 2 ^- j < C M 2-~< 1 , 



and these are summable. 



Remark: Let {n^} be one of the sparse deterministic sequences from Section [J] (either speckled 
or plaid); because of the nature of the Freiman isomorphism F p in (|4.13p and the pseudoran- 
domness of the points (j,j 2 , ■ ■ ■ ,j m ) £ Z™, it becomes vanishingly rare for two points in the kth 
block to be within of each other, and thus the Pj,M are summable as well. Therefore these 
sequences can be modified in a negligible manner so as to have gaps tending to infinity. 
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